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EDITORIAL 


Museum ethics—more than lip service 


n the past few years, museums around the world 
have started to grapple with questions about the 
origins and ethics of their collections. This in- 
cludes the acquisition and maintenance of natu- 
ral history specimens. As museums examine their 
missions and processes, it seemed like a good time 
to talk to Sean Decatur, the new president of the 
American Museum of Natural History (AMNH) in New 
York City. He spoke with me (see the full conversation 
here) about the museum’s research and how collabora- 
tions between museums and partner nations should 
aspire to assemble collections that 
responsibly disseminate informa- 
tion about human cultures, the 
natural world, and the Universe. 

Last December, Decatur tran- 
sitioned from university admin- 
istration (as president of Kenyon 
College in Ohio for 9 years) to mu- 
seum leadership. He makes a strong 
case that natural history collections 
are critical for science. The AMNH’s 
roughly 34 million specimens and 
objects, especially the biological 
collection, “tells the story of life on 
the planet,” he said. “You can really 
follow changes that have happened 
in species that might have been 
collected from the same geogra- 
phy over the course of more than a 
century now.’ But Decatur is sensitive to how museums 
must responsibly build and keep such massive collec- 
tions. He acknowledged that standards have changed 
over time “not only in terms of agreements about own- 
ership of items being collected, but collaborations that 
involve the engagement of students and faculty from 
the partner nations and institutions where we're col- 
lecting.” He boils it down to creating “a true two-way 
collaboration with students moving in both directions 
along the development of a project.” 

Determining the proper home for objects from Indig- 
enous groups in the United States is governed by the 
Native American Graves Protection and Repatriation 
Act of 1990. I asked Decatur how the museum views 
compliance with these regulations when questions are 
raised about whether objects or specimens should be in 


“There's more 
to returning items 
as a repatriation 


process than 
just putting them 
in the mail.” 


Sean Decatur, 
American Museum of Natural History 


New York. For items that belong to North American In- 
digenous groups, he emphasized that a clear process ex- 
ists for repatriation and that the museum has resources 
to work with Indigenous groups who claim ownership 
of objects that are in the museum’s collection. But he 
also wants to make sure that commitments are “more 
than lip service” by ensuring that the museum returns 
items that are not now, and were not in the past, col- 
lected under terms that do not meet today’s ethical 
standards. Moreover, Decatur is focused on building ful- 
some partnerships devoted to healing and moving for- 
ward from the past. “There’s more 
to returning items as a repatriation 
process than just putting them in 
the mail,” he said. 

Earlier this year in May, the 
AMNH opened the Richard Gilder 
Center for Science, Education, and 
Innovation, which includes class- 
rooms and programs aimed at help- 
ing students—elementary grades 
through high school—understand 
that science is a process and not 
just a collection of facts. Decatur 
believes strongly in the ability of 
museum collections and exhibi- 
tions to provide a tangible connec- 
tion to textbooks. This relationship 
“can give confidence in what sci- 
ence has to tell us about the world,” 
he said. “You can’t really develop that confidence unless 
there’s an understanding of what the process is like. The 
museum offers a great opportunity for that, partially 
because there are objects that tend to capture excite- 
ment and wonder.’ He admitted, “I still walk through 
the insectarium and look at the range of beetles that are 
in the collection and it just inspires a sense of wonder 
about the natural world.” 

Museums must take on the important work of self- 
auditing and evaluating their conceptualization of what 
is collected and how, the stories their exhibits tell, the 
perspectives and processes involved, and the larger 
research and educational roles that they serve. Sean 
Decatur knows that these overhauls start with decision- 
makers like him. 

-H. Holden Thorp 


H. Holden Thorp 
Editor-in-Chief, 
Science journals. 
hthorp@aaas.org: 
@hholdenthorp 
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New Al techniques could help better predict cyclones such as 2018’s Typhoon Yutu, which intensified rapidly before hitting Tinian and Saipan. 


METEOROLOGY 


Artificial intelligence forecasts the weather in a flash 


model using artificial intelligence (AI) forecasts 
global weather as accurately, and more than 
10,000 times faster, than the best system in use, 
researchers report. The conventional tool, run by 
the European Centre for Medium-Range Weather 
Forecasts, is computationally intensive, requir- 
ing hours of supercomputer calculations to produce 
a 10-day forecast. The new AlI-based model—named 
Pangu-Weather and developed by Huawei, the Chinese 


tech giant—improves on previous Al-powered models 
by simulating weather at different altitudes and fore- 
casting tropical cyclones, with results reliable out to 
10 days, researchers say. The research team trained the - 
model on 39 years of historical weather data; the sys- ‘ 
tem is untested yet using real-time observational data. 
Another AI-based weather model, GraphCast, described | 
by Google DeepMind in a December 2022 preprint, also 
outperformed the European system. 


in gene editing. In a bioRxiv preprint and 
in Nature, the research teams, both of 
which are affiliated with the Massachusetts 
Institute of Technology, also reported that, 
as with CRISPR, they could program RNA 
guides coupled to these enzymes to shuttle 
them to specific locations in human cells to 
precisely edit genes. The researchers specu- 
late that eukaryotic cells may have gained 
the newly identified editing genes from 
transposable elements—so-called jumping 
genes—they received from bacteria. Their 
function in eukaryotes is unknown. 


Nobelist claims antimale bias 


COMMUNITY | Organizers of a conference 
that brings together Nobel laureates and 
early-career researchers are investigating 
whether one of the prize winners violated 
the meeting’s code of conduct last week 
when he disparaged its focus on scientists’ 
diversity during a session about structural 
biology. At the Lindau Nobel Laureate 
Meeting, Kurt Wiithrich, who was awarded 
a Nobel Prize in Chemistry in 2002, also 
referred to “discrimination against” men 


CRISPR relatives found all over 


GENETICS | The revolutionary inven- 
tion of the gene editor called CRISPR 
converted an immune mechanism used 
by single-celled organisms into one of the 
mostly widely used tools in biology. Now, 
two groups have discovered that many 
multicelled organisms, including fungi, 
plants, and animals, carry genes for a 
DNA-cleaving enzyme similar to one that 
prokaryotes use in CRISPR—a discovery 
that may lead to powerful improvements 
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researchers. That comment drew pointed 
criticism from an early-career researcher 
in the audience: “There might be indi- 
vidual discrimination towards men, but it 
is nothing compared to the systematic and 
structural discrimination that women have 
to face, especially in the STEM fields,” 

she said, eliciting applause from some in 
the audience. (She did not identify herself.) 
The code of conduct bars comments that 
“reinforce social structures of domination 
related to ... gender.” To date, 60 women 
have been awarded Nobel Prizes, com- 
pared with 892 men. Just five of 39 invited 
speakers at this year’s Lindau meeting 
were women. 


Ancient DNA expert finds new job 


WORKPLACE | Alan Cooper, a noted 
specialist in ancient DNA dismissed by 
the University of Adelaide in 2019 for 
“serious misconduct,” has been appointed 
a professor at Australia’s Charles Sturt 
University (CSU). Cooper, who once ran 
Adelaide’s Australian Centre for Ancient 
DNA, faced allegations of bullying mem- 
bers of his institute, particularly students. 
Cooper will be affiliated with CSU’s 
Gulbali Institute in Albury-Wodonga. In 

a statement, CSU confirmed Cooper’s 
employment, first reported by Nature, 
but declined to give details, citing privacy 
obligations. The statement said all aca- 
demic staff “are subject to probation and 
supervisory requirements” and that the 
university “is firmly committed to ensur- 
ing a positive workplace culture.” Cooper 
did not respond to a request for comment 
before Science went to press. 


Green hydrogen method advances 


MATERIALS SCIENCE | U.S. and European 
researchers last week reported making big 
improvements to a green way for efficiently 
making hydrogen, a carbon-free fuel. The 
team varied the recipe for solar cells that can 
power catalysts to split water molecules into 
oxygen and hydrogen. Researchers previ- 
ously made the solar cells from materials 
including so-called III-V semiconductors, 
which are highly efficient. But these devices 
corrode quickly in water, rendering them 
impractical. Now, the team has replaced the 
III-Vs with devices made from layers 

of silicon and perovskite, a cheaper mate- 
rial that blends organic and inorganic 
compounds, and protected both with a 
corrosion-resistant coating. The devices can 
convert 20% of solar energy into hydrogen 
gas and survive for more than 100 hours 

of operation, beating the III-Vs on both mea- 
sures, the researchers reported in Nature 
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An Awa woman holds 
hunting bows and 
arrows in Brazil’s Caru 
Indigenous Territory 
in 2017. 


Worldwide survey challenges idea of “Man the Hunter’ 


omen have hunted in nearly 80% of forager societies surveyed in a first-of-its- 
kind global review that contradicts the once-prevailing view that men exclusively 
hunt while women gather. In the 20th century, influential anthropologists cham- . 
pioned the notion that hunting and meat consumption drove humanity's key 
evolutionary changes, including big brains, with men relying on their strength 
to hunt while women stayed near camp, gathering plants and tending children. But 
the survey data refute this. A research team examined ethnographic reports of 63 forager : 
groups spanning the late 1800s to 2010s and found that women hunted in 50 of 
them. The study, published last week in PLOS ONE, found both men and women took 
a variety of prey, although they sometimes practiced different hunting methods. 


Communications. But they need even longer 
lifetimes to become practical. 


Do averages hamper innovation? 


PEER REVIEW | Funding agencies might 
finance a larger number of risky research 
ideas if they ditched the common practice 

of averaging the quality scores provided by 
reviewers of grant applications, a study says. 
It found that biomedical scientists par- 
ticipating in a mock review were willing to 
recommend funding proposals with slightly 
lower average scores if they drew high 

scores from some reviewers. “It appears that 
our experts valued healthy disagreement” 
among reviewers over a proposal’s value, the 
research team reports in a National Bureau of 
Economic Research working paper published 
in June. They asked 563 scientists funded by 
the U.S. National Institutes of Health, which 
typically uses averaged scores to rank propos- 
als, to make funding decisions based on the 
scores, title, and abstract of grant proposals. 
Although the participants chose some stud- 
ies with high average scores, they also liked 
lower-scored proposals with high variation 


in individual reviewer scores—an approach 
that changed the ranking of 58% of propos- 
als compared to a ranking system based 
only on average scores. 


Giant rockslides cut Himalayas 


GEOLOGY | Nearly 1000 years ago, a 

giant rockslide in central Nepal collapsed 
the summit of a mountain likely taller 

than 8000 meters—a phenomenon that 
may have occurred on other peaks across 
the Himalayas, taming the world’s high- 

est mountain range, researchers report. 

The peak probably lost several hundred 
meters of elevation and 23 cubic kilometers 
of rock—a volume equivalent to that of 
22,000 Empire State Buildings—making it 
the Himalayas’s largest known rockslide, 
according to the study this week in Nature. 
High-altitude permafrost may have helped 
the range’s tall peaks resist erosion by 
weather until the underlying rock gave way 
in a massive failure, perhaps triggered by an 
earthquake or local warming. The collapses, 
the authors conclude, are the ultimate con- 
trol on the Himalayas’s towering heights. 
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The Supreme Court ruling 
brought out critics like 

two Harvard students (right) 
as well as supporters. 


After race ban, other routes to diversity beckon 


Equity advocates see paths to increasing minority representation—but say it won't be easy * 


By Jeffrey Mervis 


ast week’s decision by the U.S. Su- 

preme Court to ban the use of race in 

undergraduate admissions was a seri- 

ous setback, say advocates of increas- 

ing minority participation in science 

and engineering. But they are hoping 
that one paragraph in the court’s ruling will 
accelerate adoption of alternative practices 
meant to accomplish the same goal: an edu- 
cation system that better meets the needs of 
the country’s diverse population. 

“Tm not sure it’s 100% bad,” says math- 
ematician Richard Tapia, a winner of the 
National Medal of Science for both his re- 
search and his work mentoring students 
from groups underrepresented in science. 
“Ivll force us to change our approach to 
admissions. And I feel pretty confident that 
we can meet the challenge.” 

Jamie Lewis Keith, an attorney with 
consulting firm EducationCounsel, is tell- 
ing her university clients not to panic. “Do 
not retreat from your equity mission,” says 
Keith, who also works with SEA Change, 
an AAAS initiative to improve campus di- 
versity (AAAS publishes Science). “Reach 
out to students of color—reassure them 
that they are welcome, belong, and will be 
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supported at your institution.” 

In overlapping 6-2 and 6-3 rulings on 29 
June, the high court said the race-conscious 
undergraduate admissions practices of Har- 
vard University and the University of North 
Carolina, respectively, violated the U.S. Con- 
stitution, specifically the equal protection 
clause of the 14th Amendment. “Universities 
wishing to discriminate based on race must 
justify a compelling state interest,’ wrote 
Justice Clarence Thomas in a concurring 
opinion on the case, brought by the advocacy 
group Students for Fair Admissions. “I highly 
doubt any will be able to do so.” 

In a _ sharp dissent, Justice Sonia 
Sotomayor said the decision was a step in 
the wrong direction. “It rolls back decades 
of precedent and momentous progress,” she 
wrote. “Ignoring race will not equalize a so- 
ciety that is racially unequal,” she added. 

The court’s decision effectively ends 6 de- 
cades of affirmative action in admissions by 
colleges and universities. Modified over time 
by numerous legal challenges, the use of af- 
firmative action in its current form draws on 
extensive research showing that increasing 
campus diversity improves the quality of ed- 
ucation that students receive. But its status 
was always precarious, and the court’s con- 
servative majority had repeatedly signaled 


its intentions to eliminate race as a criterion 
in admissions. “We always knew this day 
would come,” Tapia says. 

In practice, the ruling only applies to 
those 200 or so most selective U.S. univer- 
sities that receive applications from many 
more students than they can admit. But 
those are the same institutions that train 
most academic researchers, whose role in- 
cludes teaching the next generation of those 
working in science, technology, engineer- 
ing, and math (STEM) fields. 


“Graduate STEM programs recruit from . 


the most selective universities,” says Stella 
Flores, who studies education leadership and 
public policy at the University of Texas at Aus- 
tin. “So they will take the biggest hit if those 
universities back away” from a commitment 
to diversity in undergraduate admissions. 
Advocates for broadening participation 
in STEM say they are encouraged by one 
brief passage in the majority opinion writ- 
ten by Chief Justice John Roberts. It says a 
student’s race may be relevant in the admis- 
sions process if an applicant discusses how it 
shaped them as an individual. Roberts offers 
the example of a student essay that discusses 
“overcoming racial discrimination” to dem- 
onstrate “courage and determination” or how 
an applicant’s “heritage or culture” prompted 
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them “to assume a leadership role.” 

Keith says those experiences, showing 
resilience or what has been called grit, can 
be taken into account so long as the dem- 
onstrated character qualities contribute to 
the educational mission of the institution. 
That’s especially important in STEM fields, 
she says, where many minority students 
haven’t had the same opportunities as their 
white peers to take the needed prerequisite 
courses and begin to self-identify as a po- 
tential scientist. 

Tapia, who describes his career path as 
going “from the barrios of Los Angeles to 
the White House,” sees the ruling as an invi- 
tation for research universities to go beyond 
the traditional reliance on grades and stan- 
dardized test scores in assessing a student’s 
potential. “We need to make sure that the 
person that we accept has a high probability 
of succeeding,” he says. The need to assess 
students’ experiences as well as their grades 
and scores has implications for the admis- 
sions committees themselves, he says. “To 
do that, you need to have the appropriate 
background. I understand what that takes 
because I’ve been there.” 

Ruth Simmons, a former president of 
Brown University who just retired from the 
presidency of Prairie View A&M University, 
a historically Black institution, agrees. “We 
use proxies, like test scores or where a per- 
son was educated, instead of looking at the 
whole person,” she says. “And this decision, 
at its best, may get us out of that lazy way 
of assessing people. So I’m excited about 
the possibilities.” 

Tapia emphasizes that admissions aren’t 
the only factor in diversity. Retaining those 
students once they are admitted is equally 
important. 

“You don’t wait for a minority student 
to come and ask you for help. That’s too 
late,’ he says. “They don’t want to admit 
that they’re not doing so well. So I reach 
out to those who I find out are struggling.” 

University administrators say they are 
still trying to digest the ruling and look- 
ing for guidance. They may get some soon 
from the White House, which announced 
it would convene a national summit on 
“expanding access to educational oppor- 
tunity.” The gathering will be followed in 
September by a report from the Depart- 
ment of Education on how to use “mea- 
sures of adversity ... to build inclusive, 
diverse student bodies.” 

Applying those measures will require what 
Flores calls authentic recruiting, in which 
faculty and senior administrators are trained 
to recognize the barriers that students have 
faced in seeking a quality education. “It’s go- 
ing to take real courage and conviction,” she 
adds. “And a lot more money.” 
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Winning a Nobel Prize may 
be bad for your productivity 


Fame and time commitments that come 
with the honor distract from continued research 


By David Shultz 


or many scientists, there’s no greater 

achievement than winning a Nobel 

Prize. The award recognizes break- 

throughs that have broadened our un- 

derstanding of reality and changed the 

world. It also might be a productivity 
killer. According to a working paper recently 
published by the National Bureau of Eco- 
nomic Research, scientists’ output tends to 
fall off a cliff after they've won the prize. 

Researchers behind the study analyzed 
data on Nobel Prize winners in physiology 
and medicine between 1950 and 2010, chart- 
ing how three factors changed after their 
awards: the number of papers 
they published; the impact of 
those papers, based on how 
often they were cited; and how 
novel their ideas were. The 
authors quantified novelty by 
using a computer program to 
identify distinct scientific ideas 
in the papers, then assign- 
ing each idea an age based on 
when it first appeared in the 
US. National Library of Medi- 
cine’s Unified Medical Lan- 
guage System. Papers with “younger” ideas 
relative to their publication dates were given 
higher novelty scores. 

Because the prize often goes to scientists 
late in their careers, when output typically 
declines, the study compared Nobelists with 
age-matched winners of another prestigious 
medical prize, the Lasker Award. “The match- 
ing strategy is sound,” says Kirk Doran, a so- 
cial scientist at the University of Notre Dame. 
“Overall, the results are quite convincing.” 

Before winning the prize, future Nobel lau- 
reates published more frequently than their 
colleagues who eventually won the Lasker 
Award, and they also published more-novel 
papers and garnered more citations. After 
winning a Nobel, however, the trend flipped: 
Nobel winners, on average, saw declines in 
productivity, novelty, and citations, falling 
even with the Lasker winners or below them. 
Future Nobel winners published about one 
study more per year than future Lasker win- 
ners in the 10 years leading up to the prize. 


“They're not 
just scientists 
anymore, 
but public 
intellectuals.” 


Jayanta Bhattacharya, 
Stanford University 


In the 10 years after winning, however, the 
Lasker group published about one more 
study annually than did their Nobel laureate 
peers. The switch was driven almost entirely 
by declines in the Nobel group. 

“I was actually surprised by the shift in 
novelty,’ says the working paper’s first author, 
Jayanta Bhattacharya, a health economist 
and epidemiologist at Stanford University. “I . 
expected the shifts in productivity because 
of the [new] demands on their time, but the 
shift towards less novel work is a little bit of 
a surprise to me.” 

Doran, who has studied the career effects 
of winning a Fields Medal—the Nobel’s equiv- ‘ 
alent in math—says the results gel with his 
own findings, which show that 
mathematicians also appear to 
suffer decreases in productivity 
following a win. . 

Although Bhattacharya and 
colleagues stress that their anal- 
ysis does not show a causal link 
between winning the Nobel and 
a drop in productivity, Doran 
notes that winning such a pres- 
tigious award is life-changing. 
Laureates may be overwhelmed 
with speaking engagements, 
media interviews, or book deals, all of which 
eat into the time and energy available to do ‘ 
original science. The Lasker, although highly 
prestigious, does not carry the same house- 
hold recognition. : 

“There’s just a big change in the patterns . 
that you see after the Nobel Prize, that in- 
dicates a pretty fundamental shift in their 
lives,” Bhattacharya says. “They're not just 
scientists anymore, but public intellectuals.” 

Given that, do the prize’s benefits outweigh 
the potential for lost research? Bhattacharya 
doesn’t want to scrap the Nobel Prize, but 
suggests saving the recognition for scientists 
later in their career, to prevent interrupt- 
ing our most brilliant minds in their prime. 
“Awarding Nobels as career achievement 
awards would still achieve a lot of the ben- 
efits of the Nobel while eliminating some of 
the harms,” he says. 


& 


David Shultz is a journalist in Santa Barbara, 
California. 
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them “to assume a leadership role.” 

Keith says those experiences, showing 
resilience or what has been called grit, can 
be taken into account so long as the dem- 
onstrated character qualities contribute to 
the educational mission of the institution. 
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tential scientist. 

Tapia, who describes his career path as 
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the traditional reliance on grades and stan- 
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potential. “We need to make sure that the 
person that we accept has a high probability 
of succeeding,” he says. The need to assess 
students’ experiences as well as their grades 
and scores has implications for the admis- 
sions committees themselves, he says. “To 
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background. I understand what that takes 
because I’ve been there.” 
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presidency of Prairie View A&M University, 
a historically Black institution, agrees. “We 
use proxies, like test scores or where a per- 
son was educated, instead of looking at the 
whole person,” she says. “And this decision, 
at its best, may get us out of that lazy way 
of assessing people. So I’m excited about 
the possibilities.” 

Tapia emphasizes that admissions aren’t 
the only factor in diversity. Retaining those 
students once they are admitted is equally 
important. 

“You don’t wait for a minority student 
to come and ask you for help. That’s too 
late,’ he says. “They don’t want to admit 
that they’re not doing so well. So I reach 
out to those who I find out are struggling.” 

University administrators say they are 
still trying to digest the ruling and look- 
ing for guidance. They may get some soon 
from the White House, which announced 
it would convene a national summit on 
“expanding access to educational oppor- 
tunity.” The gathering will be followed in 
September by a report from the Depart- 
ment of Education on how to use “mea- 
sures of adversity ... to build inclusive, 
diverse student bodies.” 

Applying those measures will require what 
Flores calls authentic recruiting, in which 
faculty and senior administrators are trained 
to recognize the barriers that students have 
faced in seeking a quality education. “It’s go- 
ing to take real courage and conviction,” she 
adds. “And a lot more money.” 
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even with the Lasker winners or below them. 
Future Nobel winners published about one 
study more per year than future Lasker win- 
ners in the 10 years leading up to the prize. 


“They're not 
just scientists 
anymore, 
but public 
intellectuals.” 


Jayanta Bhattacharya, 
Stanford University 
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by declines in the Nobel group. 

“I was actually surprised by the shift in 
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Jayanta Bhattacharya, a health economist 
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of the [new] demands on their time, but the 
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mathematicians also appear to 
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that you see after the Nobel Prize, that in- 
dicates a pretty fundamental shift in their 
lives,” Bhattacharya says. “They're not just 
scientists anymore, but public intellectuals.” 

Given that, do the prize’s benefits outweigh 
the potential for lost research? Bhattacharya 
doesn’t want to scrap the Nobel Prize, but 
suggests saving the recognition for scientists 
later in their career, to prevent interrupt- 
ing our most brilliant minds in their prime. 
“Awarding Nobels as career achievement 
awards would still achieve a lot of the ben- 
efits of the Nobel while eliminating some of 
the harms,” he says. 


David Shultz is a journalist in Santa Barbara, 
California. 
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PALEONTOLOGY 


Rare fossil implies deep roots for vertebrates 


Half-billion-year-old tunicate from western Utah “looks like it died yesterday” 


By Amanda Heidt 


n 2019, a finger-size fossil landed on the 
desk of Karma Nanglu, a paleontologist 
at Harvard University who specializes 
in the Cambrian and Ordovician peri- 
ods, when many of today’s animal forms 
made their entrance. The specimen had 
sat for years in the drawer of a Salt Lake City 
museum; its finders, who had pulled it from 
a fossil-rich layer of Cambrian limestone 
in western Utah, thought it might be a sea 
squirt or tunicate—a marine invertebrate 
that shares a distant ancestor with all ver- 


wasn’t involved in the work. The discovery 
offers clues to the timing and development 
of early tunicates and could even push back 
the date for the origin of tunicates’ sister 
group, the vertebrates, including humans. 
Today, some 3000 species of tunicate live 
in almost every habitat of the oceans. Most 
have a two-part life cycle, including a free- 
swimming, tadpolelike larva that settles and 
metamorphoses into a stationary adult. Tu- 
nicate larvae have a notochord, the precur- 
sor to a spinal column—a defining trait of 
the group called chordates, which includes 
all vertebrates. But only a handful of tunicate 


An ancient tunicate (fossil, left; reconstruction, right) had a saclike body and twin siphons, much like tunicates today. 


tebrates. Nanglu was excited but cautious 
at the prospect of a very ancient tunicate: 
“That’s a group for which there is essentially 
no fossil record for the entire 500 million 
years of recorded history.” 

Now, in a paper published in Nature Com- 
munications this week, Nanglu and his co- 
authors report that the exquisitely-preserved 
500-million-year-old fossil is a dead ringer 
for some tunicates today, with two siphons 
to filter organic particles from the water 
and complex musculature controlling the 
siphons. “It looks like a tunicate that died 
yesterday and just happened to fall down on 
some rock,” says Nicholas Treen, a develop- 
mental biologist at Princeton University who 
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fossils exist, for reasons paleontologists can’t 
fully explain. The best-known, 520-million- 
year-old Shankouclava anningense, has 
proven controversial: It’s tunicate-ish in that 
it has a siphon and seemed to filter feed, but 
it doesn’t resemble anything alive today. 

The new specimen, however, with details 
of its soft body preserved, is unambiguously 
a tunicate. “It’s really kind of glorious, the 
picture of this fossil?’ Treen says. Nanglu 
and colleagues named it Megasiphon thy- 
lakos for its large siphons and saclike body 
(thylakos is from the Greek word for pouch). 
Modern tunicates use these siphons to help 
filter-feed and have bands of longitudinal 
muscles running down their bodies as well 


as circular muscles around the siphons to 
control them. The details of Megasiphon’s 
musculature look almost identical to those 
of Ciona intestinalis, a well-studied modern 
tunicate, Nanglu says. 

As a tunicate matures, its siphon muscles 
develop from the same cells that form car- 
diac tissue in modern vertebrates. The mus- 
cles’ presence in Megasiphon means it may 
have already had something like a heart, 
even though its internal structures aren’t 
preserved. “Since you can see these atrial 
siphon muscles, you can almost take it for 
granted that there is a beating, vertebrate- 
like heart inside this organism,” Treen says. 

The fossil addresses a long-unresolved 
question in early chordate evolution: 
whether the common ancestor to all tu- 
nicates was a free-swimming organism or 
rooted to the bottom. Megasiphon, with 
its resemblance to living, sessile tunicates, 
strongly supports the latter hypothesis. 
The find suggests that tunicates’ two-part 
life history and ability to metamorphose is 
an ancestral characteristic of the group. 

The existence of such a complete and rec- 
ognizable tunicate at this time could also 
push back the origin of vertebrates, now 
thought to be at about 450 million years 
ago. “If a tunicate body plan was already 
established [at 500 million years ago], and 
the tunicates are actually the sister group of 
vertebrates, then vertebrates may be older 
as well,” says William Jeffery, a developmen- 
tal biologist at the University of Maryland, 
College Park, who was not involved in the 
work. Other fossils offer some support to 
this idea: Megasiphon likely lived along- 
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side a swimming, fishlike animal named . 


Metaspriggina, dated to as old as 505 million 
years ago. Metaspriggina had eyes and 
blocks of muscle, and probably a notochord. 
So, by about 500 million years ago, both it 
and Megasiphon already have many fea- 
tures considered as vertebrate hallmarks, 
suggesting an early origin for the group. 
Still, Nanglu says existing fossils aren’t 
enough to change the story of vertebrate 
evolution. “Humans are naturally pulled to 
this question of origins, and this helps de- 
velop that story, but we need more fossil 
evidence,” he says. “But ... answering these 
longstanding questions, that’s totally why 
were in it to begin with.” & 


Amanda Heidt is a science journalist in Moab, Utah. 
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EDUCATION 


Israel’s science-free schools for 
ultra-Orthodox Jews draw fire 


Researchers criticize approval of new funding for religious 
schools without requirements for broader curriculum 


By Michele Chabin, in Jerusalem 


he Israeli parliament’s recent decision 

to give a hefty budget increase to reli- 

gious schools that often do not teach 

science and math is drawing criticism 

from the nation’s research community. 

The move threatens to leave a grow- 
ing share of Israel’s young people without 
the knowledge and skills needed to thrive in 
an increasingly competitive global economy, 
they warn. 

“Hundreds of thousands of kids are get- 
ting an education for life in the Middle Ages,” 
says biochemist and Nobel laureate Aaron 
Ciechanover of the Technion Israel Institute 
of Technology. “I won't say that what they're 
learning is useless, but it won’t make them 
part of the Israeli economy or prepare them 
to study at university.” 

At issue are hundreds of elementary and 
secondary schools that serve Israel’s ultra- 
Orthodox, or Haredi, Jewish community. 
Haredim comprise roughly 13% of Israel’s 
population, but Haredi schools enroll nearly 
26% of Jewish students and 20% of all Israeli 
students. The schools, which receive public 
funding but often are not subject to govern- 
ment regulation, enrolled about 373,000 stu- 
dents during the 2021-22 school year. 

Haredi schools typically prepare boys for 
lifelong study of Jewish religious texts and 
law, rather than employment. By seventh 
grade, most of the curriculum focuses on re- 
ligious content. In 2020, for example, 84% of 
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boys in Haredi high schools studied no secu- 
lar subjects, according to political scientist 
Gilad Malach of the Israel Democracy Insti- 
tute. Haredi girls, who are expected to work 
and support their families once they marty, 
are taught a state-mandated core curriculum 
that includes secular subjects, but the lessons 
are often less advanced than those given to 
non-Haredi students. 

The quality of Haredi education has long 
been a sensitive issue in Israel. Supporters 
of Haredi schools say they are intellectually 
rigorous and help meet a sacred obligation 
to strengthen the Jewish character of the 
world’s only Jewish-majority country. But 
critics argue they contribute to high poverty 
and unemployment rates in the Haredi com- 
munity, which demographers forecast could 
constitute 25% of Israel’s population by 2050. 
Roughly half of Haredi men are jobless. 

Earlier this year, government funding for 
Haredi schools became a political flash point 
after Prime Minister Benjamin Netanyahu’s 
governing coalition, which draws critical 
support from conservative religious and na- 
tionalist parties, proposed a $333 million 
spending boost for the schools in its 2023-24 
budget. In the past, Netanyahu supported 
linking public funding for Haredi schools to 
their willingness to teach secular subjects, in- 
cluding science, math, and English. But the 
budget proposal included no such conditions 
because of opposition from key members of 
Netanyahu’s fragile coalition. 

As parliament prepared to vote on the bud- 
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Religious texts are a focus of instruction in Israel upde 
schools for the ultra-Orthodox, especially for boys. 


get in May, some academic researchers urged 
it to reject the no-strings-attached funding. 
“Research clearly shows that children who 
do not study a full core curriculum have con- 
siderable difficulty in overcoming this dis- 
advantage as adults,” stated one open letter 
signed by nearly 300 economists. “The aver- 
age proficiency of Israeli students in math, 
science and reading already places them 
below the average score in every other de- 
veloped country,’ they noted, arguing that 
Netanyahu’s government was pursuing poli- 
cies “on the basis of short-term political con- 
siderations” that “are putting Israel on the 
path to becoming a third world economy.” 
Such warnings, however, failed to prevent 
lawmakers from approving the budget on 
24 May, with Netanyahu winning support . 
from all 64 members of his coalition. 

If Netanyahu’s uneasy governing alli- 
ance ultimately fractures, a new govern- 
ment could revisit the spending plan. In the 
meantime, some science and technology ad- 
vocates are working to strengthen programs 
that provide technical training to gradu- 
ates of Haredi schools. The Israel Democ- 
racy Institute says more than 5000 Haredi 
students were enrolled in rabbi-approved, 
gender-segregated, post-high school tech- 
nological training programs during the 
2021-22 school year, with an additional 
15,600 students pursuing a college degree. 

Although graduates of MHaredi high 
schools have a weak foundation in math, 
science, and English, their rigorous reli- 
gious education, which often involves ex- 
tensive memorization and analysis, “has 
taught them how to learn,” says educational 
neuroscientist Orit Elgavi-Hershler of the 
Mofet Institute, a teacher training consor- 
tium. “And they know how to think,” she 
says, which can enable them to close edu- 
cational gaps. : 

“On the ground, I see that the young gen- . 

eration of Haredi men and women want to 
be part of the future of the economy, and 
there are a lot of companies open to recruit- 
ing more Haredim,” says Moshe Friedman, 
a Haredi entrepreneur who co-founded 
KamatTech, a training program that collab- 
orates with some of Israel’s largest technol- 
ogy firms. “I’m very optimistic.” 

Demographer Sergio DellaPergola, a 
professor emeritus at the Hebrew Uni- 
versity of Jerusalem, sees promise in such 
initiatives. “If the Haredim get a general 
education, they’ll be a highly productive 
addition to Israeli society,’ he says. “Every- 
one will benefit.” 
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Michele Chabin is a journalist in Israel. 
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COVID-19 


Studies probe COVID-19 shots’ 
link to rare symptoms 


Details emerge for uncommon cases of neurologic 
complications, blood pressure swings, and other side effects 


By Gretchen Vogel and 
Jennifer Couzin-Frankel 


OVID-19 vaccines have saved mil- 

lions of lives, and the world is 

gearing up for a new round of 

boosters. But like all vaccines, 

those targeting the corona- 

virus can cause side effects in some 
people, including rare cases of abnormal 
blood clotting and heart inflammation. 
Another apparent complication, a debili- 
tating suite of symptoms that resembles 
Long Covid, has been more elusive, its link 
to vaccination unclear and its diagnostic 
features ill-defined. But in recent months, 
what some call Long Vax has gained wider 
acceptance among doctors and scientists, 
and some are now working to better un- 
derstand and treat its symptoms. 

“You see one or two patients and you won- 
der if it’s a coincidence,” says Anne Louise 
Oaklander, a neurologist and researcher at 
Harvard Medical School. “But by the time 
you've seen 10, 20,” she continues, trailing 
off. “Where there’s smoke, there’s fire.” 

Cases seem very rare—far less common 
than Long Covid after infection. Symptoms 
can include persistent headaches, severe fa- 
tigue, and abnormal heart rate and blood 
pressure. They appear hours, days, or weeks 
after vaccination and are difficult to study. 
But researchers and clinicians are increas- 
ingly finding some alignment with known 
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medical conditions. One is small fiber 
neuropathy, a condition Oaklander studies, 
in which nerve damage can cause tingling 
or electric shock-like sensations, burning 
pain, and blood circulation problems. The 
second is a more nebulous syndrome, with 
symptoms sometimes triggered by small fi- 
ber neuropathy, called postural orthostatic 
tachycardia syndrome (POTS). It can in- 
volve muscle weakness, swings in heart rate 
and blood pressure, fatigue, and brain fog. 

Patients with postvaccination symptoms 
may have features of one or both condi- 
tions, even if they don’t meet the criteria 
for a diagnosis. Both are also common in 
patients with Long Covid, where they’re of- 
ten attributed to an immune overreaction. 

Although more researchers are now tak- 
ing Long Vax seriously, regulators in the 
United States and Europe say they have 
looked for, but have not found, a connec- 
tion between COVID-19 vaccines and small 
fiber neuropathy or POTS. “We can’t rule 
out rare cases,” says Peter Marks, director 
of the U.S. Food and Drug Administra- 
tion’s Center for Biologics Evaluation and 
Research, which oversees vaccines. “If a 
provider has somebody in front of them, 
they may want to take seriously the con- 
cept [of] a vaccine side effect,” he says. But 
Marks also worries about “the sensational 
headline” that could mislead the public, 
and he emphasizes that vaccine benefits 
far outweigh any risks. 
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Emergency medicine doctor Saleena Subaiya (left) upd: 


developed severe postvaccine symptoms, with thei: 
partner Lawrence Purpura, an infectious disease doctor. 


Despite the uncertainties, German Minis- 
ter of Health Karl Lauterbach acknowledged 
in March that though rare, Long Covid-like 
symptoms after vaccination are a real phe- 
nomenon. He said his ministry was working 
to organize funding for studies, although 
none has been announced so far. 

Researchers studying these complica- 
tions also worry about undermining trust 
in COVID-19 vaccines. Harlan Krumholz, a 
cardiologist at Yale University, says concern 
that the antivaccine movement would seize 
on any research findings made him hesitant 
at first to dive in. But about a year ago he 
and Yale immunologist Akiko Iwasaki be- 
gan to welcome postvaccination patients 
into a new study called LISTEN that also . 
includes Long Covid patients. Among other 
things, it aims to correlate symptoms with 
immune cell patterns in blood samples. 

“Tm persuaded that there’s some- 
thing going on” with these side effects, 
Krumholz says. “It’s my obligation, if I truly 
am a scientist, to have an open mind and 
learn if there’s something that can be done.” 


SCIENCE FIRST wrote about these health 
concerns in January 2022, describing ef- 
forts by scientists at the National Insti- 
tutes of Health to study and treat affected 
individuals (28 January 2022, p. 364). A 
study including 23 people was posted as a 
preprint in May 2022 but never published. 
Following Science’s story, almost 200 people 
contacted the journal sharing their post- 
vaccination symptoms. 

Research has since proceeded slowly. 
This is “a challenging outcome to moni- 
tor,” Tom Shimabukuro of the U.S. Centers 
for Disease Control and Prevention told a 
government advisory committee in January. 
Still, more than two dozen case studies have ° 
accumulated describing POTS or small fiber . 
neuropathy following a COVID-19 shot, re- 
gardless of the vaccine manufacturer. 

In 2021, Sujana Reddy, now an inter- 
nal medicine resident physician at East 
Alabama Health, published a case study of 
postvaccine POTS in a 42-year-old man. She 
then heard from more than 250 people de- 
scribing similar health concerns. Reddy be- 
gan a study documenting POTS cases, not 
yet published, which now includes 55 peo- 
ple who developed symptoms 1 to 2 weeks 
after vaccination. 

Another team scanned for postvaccine 
POTS across a specific population. Re- 
searchers led by cardiologists Alan Kwan 
and Susan Cheng at Cedars-Sinai Medi- 
cal Center analyzed a health database 
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of almost 285,000 people in the Los An- 
geles area; all had received at least one 
COVID-19 shot. They found that within 
90 days after a shot, the rate of POTS- 
related symptoms was about 33% higher 
than in the 3 months before; 2581 people 
were diagnosed with POTS-related symp- 
toms after vaccination, compared with 
1945 beforehand. However, the study 
found a bigger effect from COVID-19 it- 
self: The rate of POTS symptoms in about 
12,000 unvaccinated people after infection 
was 52% higher than beforehand. Although 
Kwan cautions against extrapolating these 
numbers to a wider population, he says the 
pattern is intriguing. “Our data show a rel- 
atively clear signal that there probably is 
an increase in POTS after vaccination and 
after infection,” he says. 

Others find the study compelling. “Even 
last year I was a little bit cautious” about 
the link between POTS and vaccination, says 
Tae Chung, a neuromuscular physiat- 
rist who runs the POTS clinic at Johns 
Hopkins University. “I didn’t have quan- 
titative data to back it up, but now I 
feel like I do.” Still, Chung stresses that 
this paper and other data also suggest 
COVID-19 vaccines protect against 
POTS and other Long Covid symptoms, 
and he remains a strong advocate for 
vaccination. 


AN IMMUNE OVERREACTION to SARS- 
CoV-2 spike protein, which COVID-19 
vaccines use to induce protective anti- 
bodies, is one possible cause of these 
symptoms. One theory is that after 
vaccination some people generate an- 
other round of antibodies targeting 
the first. Those antibodies could func- 
tion somewhat like spike itself: Spike 
targets a cell surface protein called the 
angiotensin-converting enzyme 2 (ACE2) 
receptor, enabling the virus to enter cells. 
The rogue antibodies might also bind to 
ACE2, which helps regulate blood pressure 
and heart rate, says Bernhard Schieffer, a 
cardiologist at the University of Marburg. 
If those antibodies disrupt ACE2 signaling, 
that could cause the racing heart rates and 
blood pressure swings seen in POTS. 

Small fiber neurons also have the ACE2 
receptor on their surface, so in theory rogue 
antibodies could contribute to neuropathy. 
But Matthew Schelke, a neurologist at Co- 
lumbia University who has treated small 
fiber neuropathy in both Long Covid and 
postvaccine patients, says pinning down 
a connection won’t be easy. Even when 
unusual antibodies turn up in someone’s 
blood, “it’s extremely difficult to know if any 
of these are pathogenic or if they are just 
bystanders,” he says. Other immune system 
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components that fuel inflammation may 
also harm nerves, he notes. 

Some people appear susceptible to com- 
plications after both infection and vacci- 
nation, a double vulnerability that can put 
them in an agonizing situation—eschewing 
more vaccine doses, often on the advice of 
their doctors, but also fearing the hazards 
of Long Covid. Reddy falls into this cat- 
egory. She caught the virus in early 2020 
while caring for an infected patient and 
developed Long Covid, including POTS. Her 
symptoms dramatically worsened after her 
first vaccine dose. 

An emergency medicine physician in New 
York City, Saleena Subaiya, experienced 
the same phenomenon in reverse. Within 
24 hours of the second dose of COVID-19 
vaccine in January 2021, they developed 
severe cognitive impairment, fatigue, and 
a loss of balance. Subaiya improved some- 
what over the next year, but was forced to 


Long Vax has become a focus for cardiologist Bernhard Schieffer, 
shown here treating a patient with a different condition. 


switch to a part-time research job. Then in 
December 2021, they were hit by Omicron 
and relapsed. 

Postvaccination illness is “a long, relent- 
less disease,’ says Lawrence Purpura, an 
infectious disease specialist at Columbia 
University who is also Subaiya’s partner— 
and who treats both Long Covid patients 
and those with chronic symptoms after 
vaccination. 


A DIAGNOSIS OF POTS or small fiber neurop- 
athy post-vaccination can guide treatment. 
In POTS, doctors focus on increasing salt 
and fluid intake to boost blood volume and 
maintain blood pressure. Beta blockers, 
which slow racing hearts, may also help. 
Small fiber neuropathy is treated with 
various medications to manage symptoms, 
and for severe cases sometimes intrave- 
nous immunoglobulin (IVIG), an expensive 


and hard-to-access antibody mixture that 

can tamp down immune overreactions. 

Some case studies report that IVIG helped 

people with postvaccine small fiber neuro- 

pathy, at least temporarily. 

A more radical approach is plasma 
exchange, which is sometimes used for 
autoimmune disorders. Here the patient’s 
plasma—the liquid part of the blood con- 
taining antibodies and proteins—is sepa- 
rated from the blood cells and discarded. 
The blood cells are then returned to the 
patient along with a replacement liquid. 

Plasma exchange helped a man who de- 
veloped small fiber neuropathy following 
his second dose of a COVID-19 vaccine, 
Schelke and colleagues reported in Octo- 
ber 2022 in Muscle & Nerve. “He responded 
very well,” with months of improvement, 
Schelke says, but recently returned with 
worsening symptoms. 

Schieffer, meanwhile, has developed an 
experimental treatment regime that 
he says showed promise in an un- 
published study of eight Long Co- 
vid and eight postvaccine patients. 
It includes statins, which can blunt 
inflammation in the circulatory sys- 
tem; and angiotensin II type 1 recep- 
tor blockers, which can help block the 
ACE2 pathway activation that may 
be dysregulated in patients. Schief- 
fer and colleagues are hoping to start 
a clinical trial of the therapies with 
500 people who have either Long Co- 
vid or postvaccine symptoms. 

Patient advocates are trying to spur 
new studies. In the next few months, 
the nonprofit REACTI9 plans to dis- 
tribute small grants, in the tens of 
thousands of dollars, to teams study- 
ing immunology, biomarkers, and 
other features of postvaccine illness. 

Even modest support matters, Krumholz 
says, because “it’s incumbent on us to pro- 
duce preliminary data” to win over funders 
with deep pockets. His and _ Iwasaki’s 
LISTEN project now has about 2000 par- 
ticipants, roughly 1000 with Long Covid 
and 750 with postvaccination symptoms, 
he says. (LISTEN also includes healthy 
controls.) He hopes to begin to release re- 
sults later this summer. 

As countries draft plans for a round of 
updated vaccines, some scientists theorize, 
reassuringly, that people whose immune sys- 
tems accepted a previous shot without inci- 
dent would be highly unlikely to experience 
immune dysfunction following a booster. But 
that doesn’t change the urgent need to help 
those suffering now, Reddy says. “We need to 
figure out why it’s happening to this subset 
of people. Why is it happening to them and 
not everyone else?” 
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THE STAR WATCHER 


Amateur astronomer Koichi Itagaki is one 
of the most prolific supernova hunters of all time 


By Dennis Normile 


or Koichi Itagaki, it was just another 
evening of supernova hunting. Af- 
ter his usual 7 p.m. dinner with his 
wife, he drove to his private obser- 
vatory in the hills above his home 
in Yamagata, Japan, 290 kilometers 
north of Tokyo. He set out treats 
for the only visitors he allows on 
his celestial patrols: a stray cat he 
calls Nora and the raccoon dogs that warily 
approach from the surrounding forest. He 
then took a seat within his “headquarters,” 
a cozy hut equipped with a bed, minifridge, 
microwave—and a dozen monitors used to 
control seven telescopes at three locations 
across Japan. 

On clear nights, each telescope runs 
through a routine, focusing on one of 
the approximately 1000 galaxies Itagaki 
monitors for two 15-second exposures be- 
fore swiveling to the next target. On this 
night, clouds blanketed most of the coun- 
try, leaving a clear view only for his two 
telescopes in Okayama, 700 kilometers to 
the southwest. But as the clock ticked into 
the early morning of 20 May, clouds drifted 
over Okayama, too. Itagaki called it a night 
and drove home, leaving the telescopes on 
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automatic in case the weather cleared. 

It did. The next morning he had scanned 
the night’s images for just 5 minutes when 
he spotted a new, bright object in a spi- 
ral arm of the Pinwheel galaxy, 21 mil- 
lion light-years away. “It was so bright, I 
thought there was no way this object could 
have been missed,” he says. To his surprise, 
he was the first to post the news to the 
Transient Name Server (TNS), the Inter- 
national Astronomical Union’s database 
of new celestial objects. As word spread 
on TNS and the Astronomer’s Telegram, 
an email alert service, professionals and 
amateurs alike began pointing their instru- 
ments toward SN 2023ixf, the universe’s 
newest exploding star and the closest to 
Earth in a decade. It was growing brighter 
by the minute. 

For Itagaki, an amateur astronomer and 
semiretired snack food executive, such 
discoveries are routine. SN 2023ixf is his 
172nd supernova, a total topped only by 
U.S.-based Tim Puckett, whose private ob- 
servatory in Georgia has bagged at least 
360 supernovae with the help of a world- 
wide network of volunteers who manually 
examine his images. Itagaki, by contrast, 


works alone. He “is one of the most prolific 
supernova observers in the world,” says 
Andrew Howell, an astronomer at the Uni- 
versity of California (UC), Santa Barbara. 

But his achievement goes beyond num- 
bers, Howell says. “The really special thing 
is that so many of the supernovae he finds 
turn out to be interesting.” Although his 
formal education ended with high school, 
Itagaki has also co-authored a couple 
dozen scientific papers. “He’s self-trained, 
so that’s really incredible,’ says Daichi 
Hiramatsu, an astronomer at the Center 
for Astrophysics | Harvard & Smithsonian. 

Amateurs have a long tradition of con- 
tributing to astronomy. Before the advent 
of large survey telescopes, they discovered 
many of the asteroids and comets that 
swing close to Earth. They have discovered 
galaxies and nebulae too dim and diffuse 
for large telescopes to notice. And they are 
now discovering planets outside the Solar 
System. But even among this storied popu- 
lation of hobbyists, Itagaki stands out “not 
only for his success, but also for his dedica- 
tion,” says Jeremy Shears, former president 
of the British Astronomical Association, 
which supports amateurs. 
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CHEERFUL, FRIENDLY, and easy-going, Itagaki 
is atrim 75-year-old with wisps of white hair, 
wire-rimmed bifocals, and a self-effacing 
sense of humor. “I am not an astronomer,’ 
Itagaki says, smiling broadly while waving 
his left hand dismissively, as if shooing away 
a fly. “I’m looking for new celestial bodies as 
a hobby.’ He traces his path into astronomy 
to a boyhood fascination with lenses. “I used 
to play with lenses, using sunlight to burn 
paper,’ he says. In junior high, he spent his 
allowance on a DIY telescope kit and studied 
the Moon a bit. “I also used it to spy on the 
neighbors,” he says, smiling and waving. 

Then in 1963, a 19-year-old Japanese ama- 
teur named Kaoru Ikeya grabbed national 
headlines when he discovered a comet with 
a more substantial homemade telescope. “It 
amazed me that you could search the stars 
like this,’ Itagaki says. Within Itagaki’s hut, 
a framed 1963 newspaper clipping of Ikeya’s 
achievement hangs in homage. 

After high school, Itagaki went to work 
at a local confectionary company, saving up 
to buy a 15-centimeter telescope with which 
he discovered his first comet at age 20. He 
joined Itagaki Peanuts, a snack food com- 
pany started by his father, and eventually 
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became its CEO. He is credited with being 
the first in Japan to sell snack “minipacks” 
that provide just a mouthful of peanuts, ca- 
shews, or candy-coated almonds. As the busi- 
ness prospered, Itagaki says he “invested a 
lot” in his passion—perhaps as much as the 
cost of typical Japanese house, he estimates. 
At age 60, he turned over operations of the 
30-employee firm to his sons so he could de- 
vote himself to astronomy. 

Like Ikeya and many other amateurs, 
Itagaki started out watching for comets. But 
in 1998, NASA set up a near-Earth object pro- 
gram, which enlisted major observatories to 
detect and track threatening celestial bodies. 
It left little room for amateur discoveries. So 
in 2000, “I gave up on comets and started 
searching for supernovae,” Itagaki says. 

To escape Yamagata’s lights, Itagaki 
rented a plot in the hills above the city, 
and over time added the stainless-steel 
domes that now house his 60-, 50-, and 
ll-centimeter telescopes. About 8 years 
ago he set up the second observatory in 
Okayama, and 5 years ago he added a third 
observatory with another two telescopes in 
Kochi, 800 kilometers away on Shikoku, the 
smallest of Japan’s four main islands. 


In May, the supernova 2023ixf erupted 
(white burst, far left) and became the 
vh brightest object in the Pinwheel galaxy. 


The geographic distribution means at 
least one of the observatories is in the clear 
almost every night. “I don’t think there’s 
anyone else in the world who has a network 
of telescopes operated in such a way,” says 
Hiramatsu, who adds that Itagaki also has 
an advantage in spotting supernovae that 
first appear during Japan’s nighttime be- 
cause Asia lacks a big survey. And whereas 
the big observatories tend to watch distant 
galaxies, Itagaki concentrates on nearby . 
ones, where any supernovae can be stud- 
ied when they are intrinsically fainter, 
says Azalee Bostroem, an astronomer at 
the University of Arizona. “This means 
they are discovered younger and can be 
observed later, sometimes years after the 
explosion,” she says. 


SN 20231XF SHOWS how Itagaki gets the jump 
on other observers. Monika Soraisam, an 
astronomer at the Gemini Observatory, 
had written a program to automatically 
alert her to supernova candidates caught 
by the Zwicky Transient Facility, a survey 
using the 48-inch telescope at the Palomar 
Observatory in California. “I had the alert 
in my inbox the same night that Itagaki 
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Koichi Itagaki’s homespun observatory in Yamagata is one of three he maintains across Japan in hopes of spotting supernovae before anyone else. 


made his observations but did not have the 
time to follow up until later,” she says. 

Alex Filippenko, an astronomer at UC 
Berkeley, also missed the discovery. Dur- 
ing a backyard star-viewing party the night 
of 18 May, he says he imaged the Pinwheel 
galaxy with an 11-centimeter telescope “just 
for fun.” After hearing of SN 2023ixf’s dis- 
covery, Filippenko found that he had seren- 
dipitously captured the supernova 11 hours 
before Itagaki. “Had I closely examined my 
images, I would have discovered it even ear- 
lier than Itagaki!” he says. 

There is more at stake than bragging 
rights. “The sooner after the explosion that 
a supernova is detected, the more we can 
potentially learn about the exploding star 
and the explosion itself” Filippenko says. 
Four days after its discovery, SN 2023ixf 
peaked in brightness at about magnitude 
11—still too faint to be seen with the naked 
eye but as bright as Proxima Centauri, the 
Sun’s nearest neighbor. By then, numerous 
telescopes had tuned into the spectacle— 
including giants such as the 8.1-meter 
Gemini North telescope in Hawaii. NASA’s 
Hubble and JWST space telescopes are also 
observing the supernova, which will fade 
over the course of many months. 

Analysis of the light curve suggests the 
exploding star was a red supergiant hun- 
dreds of times bigger than the Sun. Having 
exhausted its nuclear fuel, the star’s core 
collapsed, launching a shock wave that blew 
apart the star and produced a cosmic light 
show. In 2002 images from the Hubble tele- 
scope, Soraisam and her colleagues found 
what they believe is the progenitor. But she 
says they won’t know for sure until “after the 
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supernova has faded and we see that the star 
has vanished.” 

Howell says the critical data from Itagaki 
and other amateurs indicate an early, extra 
brightening of SN 2023ixf that supernova 
models don’t predict. He points to three pos- 
sible explanations for the excess light: a pre- 
cursor explosion or eruption; dense shells of 
material around the star, lit up by the emerg- 
ing shock wave; or unusual emissions from 
a nonspherical progenitor star. It is unlike 
anything seen before, “probably because we 
almost never catch supernovae this early,’ 
Howell says. 

Itagaki’s previous discoveries suggested it 
might be possible to see signs of unrest on a 
massive star even before it explodes. In 2004, 
he spotted a bright object in a spiral galaxy 
77 million light-years away from Earth. No 
professionals verified his sighting before it 
disappeared 10 days later. Following a hunch 
it could be important, Itagaki periodically 
checked the location. In 2006, he (and two 
other amateur astronomers all working inde- 
pendently) discovered supernova 2006jc. 

The explosion produced unexpected x-rays 
that Andrea Pastorello, an astronomer at 
Italy's Astronomical Observatory of Padova, 
thought were a clue to the 2004 outburst. 
Pastorello and his team concluded that what 
Itagaki saw was the progenitor star shedding 
its outer layers in a burst of light. Those lay- 
ers lit up again with x-rays 2 years later when 
the supernova ejecta caught up and collided 
with the shed material. 

It was previously thought that stars were 
quiet before going supernova. Itagaki’s 
glimpse of that 2004 precursor event along 
with other observations is “really causing 


a big shift in our understanding of super- 
novae,” Howell says. 


FOR DISCOVERING and explaining the never- 
before-seen phenomenon, Pastorello’s team 
and Itagaki were rewarded with a paper in 
Nature in 2007. It was Itagaki’s first scientific 
publication, and supernova 2006jc remains 
his most memorable discovery. “But I had 
absolutely nothing to do with the paper; it 
just has my name on it,’ he says with that 
smile and self-deprecating wave. 

Since then, Pastorello has worked with 
Itagaki on five other papers. “He is very 
collaborative, and his archive is a precious 
source of pre-explosion data,’ Pastorello 
says. That collaborative spirit has earned 
Itagaki co-authorship on what he estimates 
are “a lot” of papers, though he doesn’t know 
how many. The Astrophysics Data System, 
operated by the Smithsonian Astrophysical 


© 


Observatory, credits Itagaki as a co-author . 


on 23 refereed publications and identifies 
scores more that cite his data. Papers from 
SN 2023ixf are likely to add to the tally. 

SN 2023ixf has started its long fade out, 
but Itagaki’s passion isn’t dimming. Al- 
though already “superaged” by Japan’s defi- 
nition, he sees no reason to alter his routine. 
He’s determined to catch a supernova go- 
ing off in the Andromeda galaxy, the Milky 
Way’s nearest neighbor, one that by cosmic 
averages is long overdue. He returned to 
his headquarters the night after finding SN 
2023ixf for another solitary vigil. 

“When I was in junior high school, I 
dreamed of building a hut, putting a big 
telescope in it, and having this life,” he says. 
“That dream has come true.” 
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An expanded view of transcription 


A new method expands chromatin to provide detailed images of transcription sites 


By Timothy J. Stasevich! and Hiroshi Kimura? 


ow promoters and enhancers acti- 

vate genes, and how they commu- 

nicate to facilitate transcription, 

are unknown. The main transcrip- 

tional RNA polymerase, called RNA 

polymerase II (RNA Pol IJ), was 
originally thought to travel along a linear 
stretch of DNA. However, evidence gath- 
ered over the years paints a more complex 
picture, in which enhancer and promoter 
DNA recruit RNA Pol II and genes to larger 
superstructures, sometimes referred to as 
transcription factories, hubs, or conden- 
sates (/-4). The precise organization of 
these superstructures remains elusive, leav- 
ing many questions unanswered. On page 
92 of this issue, Pownall et al. (5) describe 
a new form of expansion microscopy they 
call ChromExM, which they used to expand 
chromatin in zebrafish embryos to visual- 
ize transcription at high resolution. This 
expanded view provides evidence that pro- 
moters and enhancers come together dur- 
ing gene activation, only to be “kicked” 
apart soon after. 

ChromExM modifies PanExM (6)—a tech- 
nique to iteratively expand the full proteome 
using an entangled hydrogel mesh—to spe- 
cifically expand chromatin 16-fold. For this, 
ChromExM uses a mesh that is optimized 
for catching and fluorescently labeling DNA- 
bound biomolecules such as histones, the 
major protein constituent of chromatin. 
Confocal imaging was used to visualize the 
labels, achieving a net spatial resolution of 
~15 nm compared with the standard opti- 
cal resolution of ~200 nm (see the figure). 
Pownall et al. also used stimulated emis- 
sion depletion super-resolution microscopy 
to further zoom in on fluorescently labeled 
histones in the expanded zebrafish embryos, 
achieving a resolution of ~3 nm. 

Chromatin forms a complex polymer, so it 
was unclear if chromatin architecture would 
be preserved after expansion. Although one 
previous study suggested that isotropic ex- 
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pansion is possible (7), other studies have 
demonstrated substantial artifacts after ex- 
pansion (8). To test the ability of ChromExM 
to preserve chromatin architecture, Pownall 
et al. developed a method to optically etch 
equidistant parallel lines into chromatin. 
The lines remained essentially equidistant 
and parallel after expansion. They also con- 
firmed that finer chromatin ultrastructure 
is preserved after expansion by comparing 
in vitro clusters of nucleosomes, which are 
basic units of chromatin consisting of DNA 
and histones, imaged with electron micros- 
copy and ChromExM. 

During the maternal-zygotic transition 
stages, when zygotic transcription is ac- 


— 
2um 


Pownall et al. were branched, suggesting 
multiple intertwined genes, indicative of a 
transcription factory or hub. In total, there 
were around 800 RNA Pol II strings per 
nucleus. Pownall et al. also observed ~30 
larger “macroclusters” of RNA Pol II per nu- 
cleus, some connected to RNA Pol II strings, 
similar to the large and transient RNA Pol II 
clusters observed in living cells (3, 11). 
Intriguingly, the macroclusters were 
sometimes filled with clusters of the tran- 
scription factor Nanog, but those clusters 
were physically distinct from the RNA Pol 
II clusters, suggesting that they bind to dif- 
ferent sections of DNA. Pownall et al. hy- 
pothesized that those DNA elements were 


50 pm 


A fluorescent probe for DNA is used to visualize the nuclei from two different zebrafish embryos. The embryo 
on the right has undergone the ChromExM expansion process, providing a much more detailed view. 


tivated in a subset of genes, Pownall et al. 
observed that highly expressed microRNA 
gene clusters had hundreds of active RNA 
Pol II molecules arranged in a line. The 
authors also saw similar strings of RNA 
Pol II at nonspecific loci scattered across 
the nuclei. They interpreted each string as 
RNA Pol II molecules arranged in single 
file, generating nascent RNAs from a sin- 
gle gene. These strings are reminiscent of 
the RNA Pol II convoys that are predicted 
from models of transcription imaging in 
live cells (9). Previously, nascent RNAs 
have been observed arranged along string- 
like loops, but only in extremely long genes 
(10). Occasionally the strings observed by 


promoters and enhancers that were in close 
proximity before active RNA production 
by elongation. To test this hypothesis, they 
blocked elongation using an RNA Pol II in- 
hibitor. This led to a significant reduction in 
the number of RNA Pol II strings, a closer 
association between Nanog and RNA Pol I, 
and an increase in RNA Pol II at promoters. 
These data indicate that there is an increase 
in transient enhancer-promoter contact 
when transcription factors cluster together 
but transcription is not activated (12). 

This expanded view of transcription 
provides further support for the “kiss and 
run” model (3), where promoters and en- 
hancers make transient contact to activate 
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genes. Furthermore, given that elongation 
inhibition led to an increase in promoter- 
enhancer association (kissing), Pownall et 
al. refine the model to “kiss and kick” to 
emphasize the active role elongation seems 
to be playing in separating promoters and 
enhancers. If elongation truly kicks apart 
enhancers and promoters, then the kick 
provides a simple explanation for why tran- 
scription would occur in well-defined bursts. 
The existence of this kick is supported by 
other observations. Phosphorylation of 
RNA Pol II at the onset of elongation can 
convert the association of RNA Pol II from 
transcription initiation factor condensates 
to splicing factor condensates, which are 
required for the processing of transcripts to 
mRNA (4). Indeed, transcription coactiva- 
tors on enhancers and elongating RNA Pol 
II are physically separated (13, 14). 

Looking ahead, it will be important to 
devise more direct tests of the kiss and kick 
model by imaging specific promoters and 
enhancers during ChromExM. This could 
be done by combining ChromExM with 
techniques for in situ labeling of endoge- 
nous DNA elements. This experiment would 
help establish whether the macroclusters 
observed by Pownall et al. correspond to a 
specific set of genes or if they represent a 
stage that all activated genes pass through. 

Other questions should also be addressed. 
For example, can the RNA Pol II strings be 
seen in most genes without ChromExM, 
and what are their dynamics in live cells? It 
is also unknown how these findings relate 
to the three-dimensional organization of 
the genome, including chromatin domains 
and their stability, and whether these fea- 
tures are specific to developing embryos. In 
contrast to the early embryo—which has a 
rapid cell cycle and thus a robust transcrip- 
tion program that is executed in a short 
space of time—enhancer-promoter contact 
might not be so critical in somatic cells (15). 
Ultimately, it is exciting to see experiments 
beginning to converge on a unified model 
for transcriptional activation. 
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PLANT BIOLOGY 


Maintaining asymmetry 


in cell division 


Promoting asymmetric division through microtubule 


dynamics establishes cell fate 


By Camila Goldy and Marie-Cécile Caillaud 


n multicellular organisms, cell division 

contributes to the generation of special- 

ized cells. In walled organisms such as 

plants, the orientation of the division 

plane by which two daughter cells sepa- 

rate is of special importance because 
cells are embedded in a matrix and cannot 
relocate. To define the future cell division 
zone before mitosis, plants have evolved a 
specific structure called the preprophase 
band (PPB), which is composed of micro- 
tubules. In the context of asymmetric cell 
division, a polar domain is established at 
the plasma membrane to allow the differ- 
entiation of one of the daughter cells while 
maintaining a stem cell population. How 
plant cells establish the polar domain re- 
mains enigmatic. On page 54 of this issue, 
Muroyama et al. (1) reveal that a polarized 
destabilization of microtubules demarcates 
a region of the plant cell as incompat- 
ible with microtubule PPB establishment, 
thereby orienting asymmetric cell division 
and guiding the development of stomata 
(pores for gas exchange). 

During leaf development, stomata are 
produced in the surface cell layer by itera- 
tive asymmetric cell divisions. This process 
involves the conversion of a subset of pro- 
todermal cells into meristemoid mother 
cells that divide asymmetrically to produce 
two daughter cells of unequal volumes. The 
small daughter cell becomes a meristemoid, 
which ultimately produces guard cells sur- 
rounding the stomatal pore while the larger 
daughter cell [the stomatal lineage ground 
cell (SLGC)] further divides asymmetrically. 
The daughter cells of the SLGC eventually 
differentiate into puzzle-shaped epider- 
mal cells (pavement cells) while maintain- 
ing a controlled amount of meristemoids 
(2). During this process, a plasma mem- 
brane-associated polarity crescent defined 
by BREAKING OF ASYMMETRY IN THE 
STOMATAL LINEAGE (BASL) and BREVIS 
RADIX family (BRXf) proteins is formed 
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and maintained throughout asymmetric 
cell division (3). The orientation of the 
subsequent asymmetric cell division, de- 
fined by the BASL-BRXf polarity crescent, 
is tightly controlled to enforce the “one-cell 
spacing rule,” which ensures that stomata 
do not directly touch, thus optimizing gas 
exchange during photosynthesis while min- 
imizing water loss (4). 

To address the long-standing question of 
how the orientation of asymmetric divisions 
is determined and maintained, Muroyama 
et al. used stomatal lineage formation in 
Arabidopsis thaliana as a model. They 
showed that the cortical division zone (a 
landmark for the site of future division) and 
the microtubule PPB cannot form within 
the BASL-BRXf polarity crescent. This result 
agrees with earlier reports showing that the 
localization of BASL-BRXf is polarized be- 
fore PPB formation when cortical division 
zone components are initially recruited to 
the cortex (5-7). Within asymmetrically 
dividing cells in the leaf epidermis, the de- 
fault state for cell division orientation ap- 
pears to be the shortest wall (requiring the 
least energy consumption). However, when 
the shortest path conflicts with the one-cell 
spacing rule, these rules need to be broken. 
The authors report that the polarized BASL- 
BRXf domain is required to override default 
division patterns during formative asym- 
metric divisions and that this is altered in 
basl mutants. Reciprocally, the establish- 
ment of the microtubule PPB is required 
for stomatal patterning. In a mutant that 
no longer forms the microtubule PPB, the 
division plane often bisects the BASL-BRXf 
polarity crescent, leading to aberrant cell 
fate in the resulting daughter cells. 

How does the BASL-BRXf polar domain 
influence PPB positioning? Using time- 
lapse imaging in planta, Muroyama ez al. 
demonstrate that the BASL-BRXf polar 
domain locally depletes cortical microtu- 
bules from the plasma membrane. There, 
the BASL-BRXf polar domain induces mi- 
crotubule plus-end depolymerization rather 
than physically preventing microtubule po- 
lymerization into the domain per se. Based 
on these results, the authors conclude that 
polarization-mediated destabilization of the 
27 
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genes. Furthermore, given that elongation 
inhibition led to an increase in promoter- 
enhancer association (kissing), Pownall et 
al. refine the model to “kiss and kick” to 
emphasize the active role elongation seems 
to be playing in separating promoters and 
enhancers. If elongation truly kicks apart 
enhancers and promoters, then the kick 
provides a simple explanation for why tran- 
scription would occur in well-defined bursts. 
The existence of this kick is supported by 
other observations. Phosphorylation of 
RNA Pol II at the onset of elongation can 
convert the association of RNA Pol II from 
transcription initiation factor condensates 
to splicing factor condensates, which are 
required for the processing of transcripts to 
mRNA (4). Indeed, transcription coactiva- 
tors on enhancers and elongating RNA Pol 
II are physically separated (13, 14). 

Looking ahead, it will be important to 
devise more direct tests of the kiss and kick 
model by imaging specific promoters and 
enhancers during ChromExM. This could 
be done by combining ChromExM with 
techniques for in situ labeling of endoge- 
nous DNA elements. This experiment would 
help establish whether the macroclusters 
observed by Pownall et al. correspond to a 
specific set of genes or if they represent a 
stage that all activated genes pass through. 

Other questions should also be addressed. 
For example, can the RNA Pol II strings be 
seen in most genes without ChromExM, 
and what are their dynamics in live cells? It 
is also unknown how these findings relate 
to the three-dimensional organization of 
the genome, including chromatin domains 
and their stability, and whether these fea- 
tures are specific to developing embryos. In 
contrast to the early embryo—which has a 
rapid cell cycle and thus a robust transcrip- 
tion program that is executed in a short 
space of time—enhancer-promoter contact 
might not be so critical in somatic cells (15). 
Ultimately, it is exciting to see experiments 
beginning to converge on a unified model 
for transcriptional activation. 
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n multicellular organisms, cell division 

contributes to the generation of special- 

ized cells. In walled organisms such as 

plants, the orientation of the division 

plane by which two daughter cells sepa- 

rate is of special importance because 
cells are embedded in a matrix and cannot 
relocate. To define the future cell division 
zone before mitosis, plants have evolved a 
specific structure called the preprophase 
band (PPB), which is composed of micro- 
tubules. In the context of asymmetric cell 
division, a polar domain is established at 
the plasma membrane to allow the differ- 
entiation of one of the daughter cells while 
maintaining a stem cell population. How 
plant cells establish the polar domain re- 
mains enigmatic. On page 54 of this issue, 
Muroyama et al. (1) reveal that a polarized 
destabilization of microtubules demarcates 
a region of the plant cell as incompat- 
ible with microtubule PPB establishment, 
thereby orienting asymmetric cell division 
and guiding the development of stomata 
(pores for gas exchange). 

During leaf development, stomata are 
produced in the surface cell layer by itera- 
tive asymmetric cell divisions. This process 
involves the conversion of a subset of pro- 
todermal cells into meristemoid mother 
cells that divide asymmetrically to produce 
two daughter cells of unequal volumes. The 
small daughter cell becomes a meristemoid, 
which ultimately produces guard cells sur- 
rounding the stomatal pore while the larger 
daughter cell [the stomatal lineage ground 
cell (SLGC)] further divides asymmetrically. 
The daughter cells of the SLGC eventually 
differentiate into puzzle-shaped epider- 
mal cells (pavement cells) while maintain- 
ing a controlled amount of meristemoids 
(2). During this process, a plasma mem- 
brane-associated polarity crescent defined 
by BREAKING OF ASYMMETRY IN THE 
STOMATAL LINEAGE (BASL) and BREVIS 
RADIX family (BRXf) proteins is formed 
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and maintained throughout asymmetric 
cell division (3). The orientation of the 
subsequent asymmetric cell division, de- 
fined by the BASL-BRXf polarity crescent, 
is tightly controlled to enforce the “one-cell 
spacing rule,” which ensures that stomata 
do not directly touch, thus optimizing gas 
exchange during photosynthesis while min- 
imizing water loss (4). 

To address the long-standing question of 
how the orientation of asymmetric divisions 
is determined and maintained, Muroyama 
et al. used stomatal lineage formation in 
Arabidopsis thaliana as a model. They 
showed that the cortical division zone (a 
landmark for the site of future division) and 
the microtubule PPB cannot form within 
the BASL-BRXf polarity crescent. This result 
agrees with earlier reports showing that the 
localization of BASL-BRXf is polarized be- 
fore PPB formation when cortical division 
zone components are initially recruited to 
the cortex (5-7). Within asymmetrically 
dividing cells in the leaf epidermis, the de- 
fault state for cell division orientation ap- 
pears to be the shortest wall (requiring the 
least energy consumption). However, when 
the shortest path conflicts with the one-cell 
spacing rule, these rules need to be broken. 
The authors report that the polarized BASL- 
BRXf domain is required to override default 
division patterns during formative asym- 
metric divisions and that this is altered in 
basl mutants. Reciprocally, the establish- 
ment of the microtubule PPB is required 
for stomatal patterning. In a mutant that 
no longer forms the microtubule PPB, the 
division plane often bisects the BASL-BRXf 
polarity crescent, leading to aberrant cell 
fate in the resulting daughter cells. 

How does the BASL-BRXf polar domain 
influence PPB positioning? Using time- 
lapse imaging in planta, Muroyama ez al. 
demonstrate that the BASL-BRXf polar 
domain locally depletes cortical microtu- 
bules from the plasma membrane. There, 
the BASL-BRXf polar domain induces mi- 
crotubule plus-end depolymerization rather 
than physically preventing microtubule po- 
lymerization into the domain per se. Based 
on these results, the authors conclude that 
polarization-mediated destabilization of the 
27 
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microtubule plus ends renders a region of 
the plasma membrane incompatible with 
microtubule PPB formation, thereby ori- 
enting cell division. 

As in plant cells, division orientation in 
animal cells often prescribes daughter cell 
polarity by determining whether a cell in- 
herits a cortical polar domain. For division- 
plane specification in 
animal cells, the orien- 
tation of the nucleus 
and/or mitotic spindle 
emerges from _physi- 
cal constraints that are 
sensed by microtubules, 
leading to bisection of 
the spindle axis in cy- 
tokinesis. To bias geo- 
metrical rules to break 
symmetry, polar do- 
mains typically recruit 
or activate cytoskeleton- 
associated proteins at 
the cortex, yielding net 
forces that reorient the 
mitotic structures such 
as the astral microtu- 
bules, which extend from the spindle poles 
(8). In the context of asymmetrical division 
in plant cells, the study by Muroyama et al. 
suggests that the orientation of the asym- 
metric division is not obtained through an 
effect of astral microtubules, because plants 
do not have them, but rather through ef- 
fects on cortical microtubules transitioning 
toward the formation of a PPB before mito- 
sis. Moreover, in plant asymmetric cell divi- 
sion, the polar domain does not specify a 
position for cytokinesis but acts as a forbid- 
den landmark where microtubules cannot 
align, preventing the division plane from 
colliding with it. 

The proposed mechanism by which po- 
larity proteins could specify the division 
plane by creating a microtubule-depleted 
zone raises the questions of how the BASL- 
BRXf complex is recruited to the plasma 
membrane and how it excludes microtu- 
bules. Based on preliminary genetic evi- 
dence (9), BASL and BRXf are mutually 
required for the polarization of each other. 
In the current model, BRXf proteins re- 
versibly associate with the plasma mem- 
brane through palmitoylation at their N 
termini (9). BASL is recruited through an 
interaction between BASL and the BRX do- 
mains that define the BRXf proteins. The 
nature of the upstream signal that defines 
the location of BASL-BRXf polarization re- 
mains to be discovered. 

It is likely that the scaffold proteins BASL 
and BRXf recruit a microtubule-associated 
protein to affect microtubule dynamics 
rather than directly mediating plus-end de- 
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Super-resolved imaging shows guard 
cell cortical microtubules (pink) 
in stomata of Nicotiana benthamiana. 


polymerization. The critical microtubule- 
associated proteins might not be cell divi- 
sion specific because the same microtubule 
depletion is observed after division in SLGCs 
for as long as the polar domain persists. But 
how can the component responsible for the 
direct effect on microtubules be identified? 
The establishment of asymmetry in animal 
cells often involves both 
biochemical and me- 
chanical processes (8). 
Because microtubules 
were proposed to be 
tension sensors (JO), the 
mechanical constraints 
and subsequent signal- 
ing might be aspects to 
consider. 

During pavement cell 
differentiation, specific 
small guanosine _ tri- 
phosphatases (GTPases) 
called Rho of Plants 
(ROPs) act as master reg- 
ulators of cell polarity, 
mediating the recruit- 
ment of microtubule- 
associated proteins in the polar domain to 
shape cytoskeleton distribution. Upstream, 
ROPs are regulated by several stimuli, in- 
cluding the receptor-like kinase FERONIA 
(FER) pathway, which participates in sens- 
ing and/or transducing mechanical signals 
(11-13). Investigating the cross-talk between 
ROP signaling and symmetry breaking 
in the context of asymmetric cell division 
might be key in the future. Overall, the study 
by Muroyama e¢ al. highlights the singular 
mechanism used by plants to control the 
inheritance of fate regulators by connecting 
polarity with microtubule interactions. 
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he discovery of the Higgs boson in 

2012 marked the completion of the 

Standard Model (SM) of particle 

physics. The SM remains the best the- 

ory of the known fundamental parti- 

cles and how they interact, but there 
is evidence that the SM is incomplete. A 
shortcoming is that the SM predicts nearly 
equal amounts of matter and antimatter 
after the Big Bang, which would result in 
a cosmos full of light after the matter and 
antimatter particles mutually annihilate. 
The physicist Andrei Sakharov outlined 
the basic conditions that are needed to 
create the matter imbalance, called the 
baryon asymmetry of the Universe (BAU), 
that allows for our existence (7). One of the 
Sakharov conditions, time symmetry vio- 
lation (TSV), suggests that other particles 
and interactions beyond those of the SM 
await discovery. On page 46 of this issue, 
Roussy et al. (2) report the most stringent 
limits to date on the size of the electron’s 
permanent electric dipole moment (EDM), 
which may help explain the BAU. 

Time symmetry ensures that when we 
imagine reversing time, the physical laws 
follow their time-reversed paths. TSV is 
needed to generate the BAU because it is a 
necessary condition to convert antimatter 
to matter, which may have happened in the 
early Universe just after the Big Bang, leav- 
ing more matter than antimatter. After an- 
nihilation of particles with their antipar- 
ticles, the excess amount of matter formed 
the current Universe. The physics that 
describes the SM mostly obeys time sym- 
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microtubule plus ends renders a region of 
the plasma membrane incompatible with 
microtubule PPB formation, thereby ori- 
enting cell division. 

As in plant cells, division orientation in 
animal cells often prescribes daughter cell 
polarity by determining whether a cell in- 
herits a cortical polar domain. For division- 
plane specification in 
animal cells, the orien- 
tation of the nucleus 
and/or mitotic spindle 
emerges from _physi- 
cal constraints that are 
sensed by microtubules, 
leading to bisection of 
the spindle axis in cy- 
tokinesis. To bias geo- 
metrical rules to break 
symmetry, polar do- 
mains typically recruit 
or activate cytoskeleton- 
associated proteins at 
the cortex, yielding net 
forces that reorient the 
mitotic structures such 
as the astral microtu- 
bules, which extend from the spindle poles 
(8). In the context of asymmetrical division 
in plant cells, the study by Muroyama et al. 
suggests that the orientation of the asym- 
metric division is not obtained through an 
effect of astral microtubules, because plants 
do not have them, but rather through ef- 
fects on cortical microtubules transitioning 
toward the formation of a PPB before mito- 
sis. Moreover, in plant asymmetric cell divi- 
sion, the polar domain does not specify a 
position for cytokinesis but acts as a forbid- 
den landmark where microtubules cannot 
align, preventing the division plane from 
colliding with it. 

The proposed mechanism by which po- 
larity proteins could specify the division 
plane by creating a microtubule-depleted 
zone raises the questions of how the BASL- 
BRXf complex is recruited to the plasma 
membrane and how it excludes microtu- 
bules. Based on preliminary genetic evi- 
dence (9), BASL and BRXf are mutually 
required for the polarization of each other. 
In the current model, BRXf proteins re- 
versibly associate with the plasma mem- 
brane through palmitoylation at their N 
termini (9). BASL is recruited through an 
interaction between BASL and the BRX do- 
mains that define the BRXf proteins. The 
nature of the upstream signal that defines 
the location of BASL-BRXf polarization re- 
mains to be discovered. 

It is likely that the scaffold proteins BASL 
and BRXf recruit a microtubule-associated 
protein to affect microtubule dynamics 
rather than directly mediating plus-end de- 
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Super-resolved imaging shows guard 
cell cortical microtubules (pink) 
in stomata of Nicotiana benthamiana. 


polymerization. The critical microtubule- 
associated proteins might not be cell divi- 
sion specific because the same microtubule 
depletion is observed after division in SLGCs 
for as long as the polar domain persists. But 
how can the component responsible for the 
direct effect on microtubules be identified? 
The establishment of asymmetry in animal 
cells often involves both 
biochemical and me- 
chanical processes (8). 
Because microtubules 
were proposed to be 
tension sensors (JO), the 
mechanical constraints 
and subsequent signal- 
ing might be aspects to 
consider. 

During pavement cell 
differentiation, specific 
small guanosine _ tri- 
phosphatases (GTPases) 
called Rho of Plants 
(ROPs) act as master reg- 
ulators of cell polarity, 
mediating the recruit- 
ment of microtubule- 
associated proteins in the polar domain to 
shape cytoskeleton distribution. Upstream, 
ROPs are regulated by several stimuli, in- 
cluding the receptor-like kinase FERONIA 
(FER) pathway, which participates in sens- 
ing and/or transducing mechanical signals 
(11-13). Investigating the cross-talk between 
ROP signaling and symmetry breaking 
in the context of asymmetric cell division 
might be key in the future. Overall, the study 
by Muroyama e¢ al. highlights the singular 
mechanism used by plants to control the 
inheritance of fate regulators by connecting 
polarity with microtubule interactions. 


REFERENCES AND NOTES 


1. A.Muroyama, Y. Gong, K. S. Hartman, D.C. Bergmann, 
Science 381, 54 (2023). 
. X.Guo, L.Wang, J. Dong, New Phytol. 232, 60 (2021). 
. L.R.Lee, D.C. Bergmann, J. Cell Sci.132, jcs228551 
(2019). 
4. L.J.Pillitteri, J. Dong, Arabidopsis Book 11, e0162 (2013). 
5. A.Muroyama, Y. Gong, D.C. Bergmann, Curr. Biol.30, 
4467 (2020). 
6. Y.Gonget al., Development 148, dev199919 (2021). 
7. X.Guo, C.H. Park, Z.-Y. Wang, B.E. Nickels, J. Dong, Nat. 
Plants 7,667 (2021). 
8. N.lerushalmi, K. Keren, Curr. Opin. Cell Biol.72,91 
(2021). 
9. M.H. Rowe, J. Dong, A.K. Weimer, D.C. Bergmann, 
bioRxiv 10.1101/614636 (2019). 
10. O.Hamant, D. Inoue, D. Bouchez, J. Dumais, E. Mjolsness, 
Nat. Commun. 10, 2360 (2019). 
ll. C.Zhang etal., Curr. Biol.32, 518 (2022). 
12. W.Tangetal., Curr. Biol. 32,508 (2022). 
13. T.Lauster et al., Curr. Biol. 32,532 (2022). 


ACKNOWLEDGMENTS 

The authors thank Y. Jaillais and G. Ingram for their input 
onthe manuscript. M.-C.C. is supported by French National 
Research Agency grant ANR-20-CE13-0026-02.C.G. is 
funded by a European Molecular Biology Organization 
(EMBO) fellowship (ATLF 646-2021). 


10.1126/science.adi6664 


PHYSICS 


Probing 
fundamental 
particles with 
molecules 


Molecular spectroscopy 
constrains the size 

of the electron’s electric 
dipole moment 


By Mingyu Fan and Andrew Jayich 


he discovery of the Higgs boson in 

2012 marked the completion of the 

Standard Model (SM) of particle 

physics. The SM remains the best the- 

ory of the known fundamental parti- 

cles and how they interact, but there 
is evidence that the SM is incomplete. A 
shortcoming is that the SM predicts nearly 
equal amounts of matter and antimatter 
after the Big Bang, which would result in 
a cosmos full of light after the matter and 
antimatter particles mutually annihilate. 
The physicist Andrei Sakharov outlined 
the basic conditions that are needed to 
create the matter imbalance, called the 
baryon asymmetry of the Universe (BAU), 
that allows for our existence (7). One of the 
Sakharov conditions, time symmetry vio- 
lation (TSV), suggests that other particles 
and interactions beyond those of the SM 
await discovery. On page 46 of this issue, 
Roussy et al. (2) report the most stringent 
limits to date on the size of the electron’s 
permanent electric dipole moment (EDM), 
which may help explain the BAU. 

Time symmetry ensures that when we 
imagine reversing time, the physical laws 
follow their time-reversed paths. TSV is 
needed to generate the BAU because it is a 
necessary condition to convert antimatter 
to matter, which may have happened in the 
early Universe just after the Big Bang, leav- 
ing more matter than antimatter. After an- 
nihilation of particles with their antipar- 
ticles, the excess amount of matter formed 
the current Universe. The physics that 
describes the SM mostly obeys time sym- 
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metry; although there is TSV in the SM, 
it is not nearly enough to account for the 
BAU. TSV interactions arising from undis- 
covered particles can generate permanent 
EDMs of SM particles such as the electron. 
An EDM in an electric field causes an en- 
ergy shift when its orientation is flipped 
with respect to the field. An electron EDM 
(eEDM) violates time-reversal symmetry: 
If time were reversed, its magnetic mo- 
ment would flip and the EDM would not, 
looking fundamentally different from 
before time-reversal (see the figure). All 
eEDM measurements so far are consistent 


was anti-aligned to the molecule’s massive 
electric field. They probed the molecular 
energy shift that arises between the two 
configurations, allowing them to limit the 
eEDM to be no larger than 4.1 x 10° e cm. 
With their vacuum apparatus that fits on 
a table, they were able to probe for undis- 
covered particles up to a mass of 40 TeV, 
which is 10 times the energy of particles 
than can be observed in the Large Hadron 
Collider at CERN, which costs about $4.75 
billion to build and $1 billion to run an- 
nually. Considerable effort by Roussy et 
al. went into meticulously studying their 


Trying to understand the missing matter 

The Standard Model does not explain important problems such as the baryon asymmetry of the 
Universe (BAU), which requires undiscovered particles for additional time symmetry violation (TSV) 
interactions to convert antimatter to matter in the early Universe. Such particles may be discovered 

by precision measurements of the electron electric dipole moment, which is a TSV moment induced by 


an asymmetry of the charge distribution in the electron. 


Standard Model 


b, bottom; c, charm; d, down; e, electron, v,, electron neutrino; 
g, gluon; H, Higgs boson; 2, muon; v,,muon neutrino; -y, photon; 
s, strange; 7, tau; v,, tau neutrino; t, top; u, up; W, W boson; Z, Z boson. 


with zero, indicating that no particle has 
been found to induce enough TSV to gener- 
ate an eEDM that is within the current ex- 
perimental sensitivities. However, tighter 
experimental bounds on the eEDM probes 
increasingly higher particle masses, analo- 
gous to how higher-energy particle collid- 
ers can search for more-massive particles. 
Roussy et al. measured eEDM by preci- 
sion spectroscopy of HfF* molecules. They 
produced and trapped HfF* molecules in 
an ion trap and polarized them with a ro- 
tating electric field. The polarization con- 
trols the orientation of the large internal 
molecular field that the molecule’s valence 
electron experiences. With an additional 
applied magnetic field, lasers, microwaves, 
and radiofrequency fields, Roussy et al. 
measured the energy of HfF* molecules 
in two configurations: when the electron’s 
magnetic moment was aligned and when it 
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experimental apparatus and measurement 
technique so they could understand sys- 
tematic uncertainties in minute detail to 
ensure that no spurious signals were mis- 
takenly introduced (3). 

Sensitivity in the detection of the eEDM 
increased when molecules were used in- 
stead of atoms (4). The two previous eEDM 
limits come from brief measurements of 
many thorium oxide (ThO) molecules (5, 
6) using a cryogenic beam as compared 
to the trapped molecular ion approach of 
Roussy et al., in which fewer molecules 
are measured for much longer times. The 
combined ThO and HfF* measurements 
set more stringent limits on sources of TSV 
than either measurement does on its own. 
This is because measurements in different 
molecules have different sensitivities to 
underlying sources of TSV. In the event of a 
nonzero eEDM measurement, a combined 


analysis of multiple measurements in dif- 
ferent systems narrows down the contribu- 
tions from the individual TSV sources and 
provides more stringent bounds to test 
theories. Knowledge from EDM measure- 
ments across multiple systems would help 
guide the requirements of a future high- 
energy particle collider that could create 
the TSV particles responsible for the BAU. 

The SM predicts a very small eEDM of 
less than 10°** e cm, which is roughly a bil- 
lion times smaller than the limit reported 
by Roussy et al. This extremely small SM 
contribution to the eEDM gives future ex- 
periments operating at higher sensitivities 
plenty of room for discovery. In parallel 
with the ThO and HfF* eEDM experiments, 
there has been progress toward next-gen- 
eration TSV measurements that aim to 
improve sensitivity by using more-sensi- 
tive molecular species, larger numbers of . 
a given species, and longer measurement 
coherence times. These include using poly- 
atomic molecules (7), exotic isotopes with 
deformed nuclei (8), and even probing TSV 
in solids where larger numbers of atoms 
are trapped in a crystal (9). Technological 
improvements to allow such advances in- 
clude laser cooling and trapping of poly- 
atomic molecules (JO), spectroscopy of ra- 
dioactive molecules (JI), and fine-grained 
control of molecular ions with quantum 
logic spectroscopy (12, 13). A combination 
of such approaches could allow the detec- 
tion of high-energy particles perhaps at 
PeV energies (1 PeV = 10” eV), and along 
the way the sought-for undiscovered par- 
ticles may be detected. Such experiments 
will constrain high-energy theories beyond 
the SM. But upgrades of both the ThO 
beam experiment and the trapped molecu- 
lar ion technique, including a switch from 
HfF* to ThF*, are poised to enhance the 
particle-mass reach of eEEDM experiments in 
the near future. 
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SOLAR CELLS 


Tandems have the power 


Perovskite-silicon tandem solar cells break 
the 30% efficiency threshold 


By Stefaan De Wolf and Erkan Aydin 


ne way to increase the efficiency of 

a solar cell is to optimize it to the 

wide spectrum of sunlight for con- 

version into electricity. In this vein, 

a tandem solar cell may comprise 

a top cell that absorbs high-energy 
photons and a bottom cell that absorbs 
low-energy photons. Such a device can 
be fabricated as a stack of layers, with 
the top and bottom cell interconnected 
to each other. Monolithic integration of a 
perovskite solar cell and silicon solar cell 
into a tandem device is a promising path 
toward high-performance photovoltaics 
(PVs) at affordable cost. On pages 59 and 
63 of this issue, Chin et al. (1) and Mariotti 
et al. (2), respectively, report different ap- 
proaches to achieving power conversion ef- 
ficiencies (PCEs) in perovskite-silicon tan- 
dem solar cells of >30%. Overcoming this 
threshold provides confidence that high- 
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performance, low-cost PVs can be brought 
to the market. 

In 2022, the cumulative installed global 
PV capacity reached 1.2 TW (3). Yet to avert 
the catastrophic scenarios associated with 
global warming, the total PV capacity needs 
to increase to ~’75 TW by 2050 (4). Over the 
past decades, growth of the mainstream PV 
market, which is dominated by silicon tech- 
nologies, has largely been fueled by a system- 
atic increase in the performance of PV cells 
and modules (cells connected together in 
strings and packaged to form larger units), 
combined with enhanced module reliability 
and scaling for manufacturing. Collectively, 
these factors have driven down the levelized 
cost of electricity (costs over the system’s 
lifetime divided by its energy production) 
for PVs. Modern commercial silicon solar 
cells now reach PCEs >24% (5), and the best 
laboratory cell has a PCE of 26.8% (6). This is 
attributed to innovations such as surface and 
contact passivation—that is, a means to sup- 
press the recombination of charge carriers 
(electrons and holes). However, with a maxi- 
mum theoretical PCE of 29.4% (7), practical 
performance limits for a single-junction sili- 
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A laboratory-scale perovskite-silicon tandem solar, upd: 


is shown; the blue aperture defines the device area. 


con PV (a solar cell made from a silicon wa- 
fer as light absorber) are in sight. To sustain 
further decreases in levelized cost of electric- 
ity of commercial PVs, scalable higher-per- 
formance solar cell concepts with reliability 
that is equivalent to silicon products need to 
come to maturity quickly. 

Tandem solar cells are the most straight- 
forward route toward decreasing the level- 
ized cost of electricity, well beyond what 
is possible for single-junction solar cells. 
This is because tandem cells can drasti- 
cally lower charge-carrier thermalization. 
Thermalization is an energy exchange pro- 
cess of carriers that results in heat dissipa- 
tion, which limits the efficiency of a solar 
cell. As well, optical transmission loss lim- 
its the performance of single-junction so- . 
lar cells with a given bandgap energy, E,,. 
Thermalization losses are associated with 
incident photons with energy > E,; trans- 
mission losses are associated with photon 
energies < E,,. Tandem cells minimize these 
losses by stacking solar cells, positioning 
the cell with the larger bandgap toward the 
sunward side. With a bandgap of 1.12 eV 
and a proven track record as an affordable, 
mature PV technology, silicon is well suited 
to act as a bottom cell in a tandem device. 
The ideal top cell bandgap for a two-junc- 
tion tandem is in the 1.65 to 1.75 eV range. 

Metal halide perovskites have emerged 
in the past 15 years as a promising semi- 
conductor class for multiple optoelectronic 
applications, especially solar cells. Because 
their bandgap can be tuned to cover the 
range desirable for top cells, their integra- 
tion with silicon into tandems has quickly 
become an active area of research. Toward 
this end, several approaches for depositing 
perovskites onto silicon have been devel- 
oped that are based on solution processes, 
vacuum deposition, or a combination of ° 
both methods. 

Initial work on monolithic tandem cells 
focused on the development of low-tem- 
perature deposition processes for the top 
cell to guarantee compatibility with the 
high-efficiency bottom cell. The latter is 
often based on silicon heterojunction tech- 
nology. Tandem cells are now designed to 
collect electrons on their sunward side and 
holes on the rear side because this maxi- 
mizes electric current flow. Such tandem 
polarity makes use of specific contact lay- 
ers with minimized loss of current due to 
light absorption without charge generation 
(known as parasitic optical absorption) (8). 
To further enhance the current density, 
bottom cells with textured surfaces (both 
sunward and rear sides) have been increas- 
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ingly adopted to enhance light trapping (9). 
Nevertheless, to fully realize the potential 
of perovskite-silicon tandems, irrespective 
of how the perovskite is deposited, a main 
hurdle has been the minimization of charge 
loss from recombination of electrons and 
holes in the perovskite top cell, in particular 
at its interface with the electron-collecting 
contact stack at the light-facing side. This 
contact stack usually features aC, fullerene 
layer (an electron transport layer) that effi- 
ciently extracts electrons from the perovskite 
layer. In previous works, insertion of a thin 
inorganic film between the perovskite and 
C,, layer proved beneficial to minimize volt- 
age losses (8, 10) but left residual defects on 
the perovskite surface unpassivated. 

Both Chin et al. and Mariotti et al. report 
different strategies that resolve the issue of 
top contact passivation through molecular 
approaches, resulting in certified >30% ef- 
ficient perovskite-silicon tandems (see the 
figure). This efficiency threshold has been 
broken for different types of perovskite-sil- 
icon tandems, pointing to the broad design 
space of perovskite technology. 

In 2022, Chin et al. announced the first 
monolithic perovskite-silicon tandem to 
exceed the 30% efficiency barrier, with a 
certified PCE of 31.25% (11). They now de- 
scribe how they achieved this with a hybrid 
two-step perovskite deposition method 
that combines thermal evaporation of an 
inorganic template followed by a solution- 
based conversion of this scaffold into the 


perovskite phase. The conversion is accom- 
plished through the intercalation of organic 
constituents. This occurs on the textured 
sunward side of the silicon bottom cell that 
is textured on both sides. However, because 
of the high film conformality of this pro- 
cess, the top surface of the perovskite re- 
mains textured too. As a result, spin-casting 
(using centrifugal force) a solution onto 
this surface as a passivation approach is not 
directly applicable to this device structure 
because it would lead to inhomogeneous 
molecular coverage on the textured top 
surface. This would adversely affect device 
performance. Chin et al. overcame this 
challenge by blending a phosphonic acid- 
based additive [FBPAc (2,3,4,5,6-pentafluo- 
robenzylphosphonic acid)] to the precursor 
solution in the spin-casting step. The bulky 
FBPAc molecules are then expelled toward 
the top surface during perovskite crystal- 
lization, yielding larger crystal domains. 
Moreover, once expelled, FBPAc molecules 
passivated surface defects, reducing charge 
loss from recombination at the perovskite- 
C,, top interface. This substantially im- 
proved tandem performance. 

Following a different approach, Mariotti 
et al. show that the perovskite-C,, interface 
can also be passivated through insertion 
of piperazinium iodide, an ionic liquid, 
for tandems that use a solution-processed 
perovskite on a front-flat silicon bottom 
cell. Piperazinium iodide was chosen be- 
cause it contains electron acceptor and 


Tandem solar cell structures 


Atandem solar cell consists of a silicon bottom cell and a perovskite top cell. Perovskite absorbs blue light 
(high-energy photons) best, whereas silicon absorbs red light (low-energy photons). This combination 
maximizes the capture and conversion of sunlight into electricity more efficiently than that by single-cell types 
(single-junction solar cell). Contact layers are required to extract current from the tandem and need to be 
engineered toward maximal light-incoupling and minimal charge losses. Two new fabrication methods achieve 
cells with a power conversion efficiency of >30%. Both position a molecular passivant at the perovskite- 
electron transport layer interface of the front contact, minimizing charge recombination. 
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Hybrid-processed on textured 
A hybrid, two-step process deposits perovskite 
conformally onto the silicon cell. The device 


is textured on both the sunward and rear sides. 
The perovskite mimics the surface texturing. 
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Solution-processed on planar 
Aspin-casting solution process 
deposits onto a largely smooth 


silicon surface. The textured rear 
enhances light trapping of red light. 
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electron donor functional groups that 
can interact with positive and negative 
perovskite surface defects, respectively. 
This modification improved the energetic 
band alignment, reduced recombination 
losses, and enhanced electron extraction at 
the perovskite-C,, interface. The resulting 
perovskite-silicon tandem demonstrated 
impressive open-circuit voltages of up to 
2.00 V. The authors thus provide an expla- 
nation for their 2022 announcement of a 
perovskite-silicon tandem with certified 
PCE of 32.5%. 

As next steps, it remains important to fur- 
ther improve the performance of perovskite- 
silicon tandems that could achieve a PCE 
well above 35%. At the same time, focus 
is needed on improving device reliability. 
Arguably, the most critical factor lies in 
the annual degradation rate under actual 
outdoor conditions, which for perovskite- 
silicon tandems remains largely unknown. 
To be commercially viable, this degradation 
rate should be on par with mainstream PV 
technologies, which is less than 1% relative 
per year. Furthermore, scaling from the cur- 
rent 1 cm? record-device areas to >100 cm’, 
as is common in mainstream silicon manu- 
facturing, is also critical. Notably, a certified 
record PCE of 28.6% for a commercial-sized 
device was recently announced by Oxford 
PV (72). To be market competitive, in a high- 
throughput environment (manufacturing 
thousands of devices per hour), scaled tan- 
dems likely should show a net PCE benefit 
of at least 3% absolute on average, com- 
pared with the best commercial silicon cells 
(which is a continuously moving target). 

Reaching these performance, stability, 
and scaling goals in about 5 years—which 
is the time it may take for commercial sili- 
con PV to reach its practical PCE limit—is 
ambitious. But full engagement toward 
such ambitions would not only expedite the 
maturity of an exciting renewable energy 
technology but, more importantly, acceler- 


ate achieving the critical 75-TW global PV . 


capacity target. @ 
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Animal welfare: Methods to 
improve policy and practice 


New methods are emerging to quantify human and animal 
welfare on a common scale, creating new tools for policy 


By Mark Budolfson'2*, Bob Fischer*5, 
Noah Scovronick® 


here is growing international consen- 

sus that animal welfare is a crucial 

consideration in policy analysis, af- 

fecting domains ranging from food 

systems to biomedical research. 

Concern for animal welfare also fea- 
tures in many government regulations, 
certification programs, and institutional 
ethics codes across the globe and is central 
to many philanthropic and values-based 
investment decisions. However, although 
there are well-developed quantitative 
tools for incorporating human welfare 
into policy analysis, comparable tools for 
animal welfare are in their earliest stages. 
Without them, it is impossible to assess the 
net welfare impacts of a policy on humans 
and nonhumans alike on a common scale, 
which is crucial for making informed and 
transparent trade-offs (7). In practice, then, 
animal welfare is often ignored. Given that 
animal welfare matters in many cases, 
there is an urgent need for best-practice 
methods for integrating animal welfare 
into decision analyses. 

The goal of this policy forum is three- 
fold: first, to identify some important pol- 
icy questions involving animal welfare that 
remain unanswered; second, to crystalize 
the limitations of existing tools for mak- 
ing progress on these questions, owing to 
the difficulty of directly comparing human 
and animal welfare on a common scale (as 
well as comparisons of animals of differ- 
ent species on a common scale); and third, 
to outline new methods that are emerging 
to overcome these challenges. We hope 
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this analysis will accelerate refinement of 
these methods, increase interest in their 
deployment in policy analysis, and encour- 
age broad interdisciplinary collaboration 
toward that end. 


IMPORTANT POLICY QUESTIONS 
Several policy questions illustrate where 
animal welfare is of crucial importance but 
where the absence of a common scale for as- 
sessing welfare inhibits progress. 


Health and biomedicine 

Governmental agencies sometimes cull wild 
animals to reduce zoonotic disease transmis- 
sion [e.g., Australia’s culling of bats to reduce 
the threat of Hendra virus (2)]. When, if ever, 
does harm to wildlife provide sufficient rea- 
son to accept a higher level of human risk 
(3)? How should frameworks such as One 
Health and Planetary Health—which aim to 
improve the health of people, animals, and 
their shared environment—balance trade- 
offs within and across species? 

Animal models are standard in biomedi- 
cal research, though the rates at which ther- 
apies that work in animals translate to hu- 
mans are often poor (4). Given that existing 
regulations call on researchers to reduce the 
number of animals used, when possible, it is 
important to compare animal welfare costs 
with the expected value of research. How can 
this be done rigorously? 


Domesticated animals, food, 

and climate change 

No-kill animal shelters are popular because 
of public discomfort with euthanizing un- 
wanted pets. Should animal shelters be pub- 
licly subsidized to avoid having to euthanize 
pets—subsidies that would benefit animals 
but would divert resources from other hu- 
man interests? 

Sustainable intensification in agricul- 
ture involves improving farm productiv- 
ity without increasing inputs or land use. 
Unfortunately, many forms of sustainable 
intensification have negative implications 
for animal welfare, often due to high stock- 


6 


ing densities (5). When, if ever, is it be ee 


to increase environmental sustainability 110 — 
reduces animal welfare? 

Estimates of the social cost of carbon, 
which quantify damages from climate 
change in economic terms, have improved 
considerably over time as a wider range of 
harms have been incorporated (6). However, 
the welfare impacts from climate change on 
both wild and domesticated animals have 
largely been ignored, partly because of the 
difficulty of quantifying those impacts (7). 


Trade and taxation 
The General Agreement on Tariffs and Trade 
(GATT) protects free trade. Some countries 
have pushed for exceptions to block the im- 
port of low-welfare animal products (8). Is 
it appropriate to reduce economic efficiency 
for the sake of advancing animal welfare? 
There have been calls for taxes on low-wel- . 
fare animal products (9). However, without 
methods for quantifying animal welfare, it 
is unclear how to price negative welfare im- 
pacts from case to case. 


Conservation biology and resource 
management 
New Zealand’s Predator Free NZ initiative, 
which aims to eliminate all non-native pred- 
ators from the country, has been hotly de- 
bated because of some of the wildlife control 
methods on which it relies. When, if ever, do 
the welfare impacts of such population man- 
agement methods, such as anticoagulant 
poisons, outweigh the benefits of promoting 
the restoration of native ecosystems? 
During the COVID-19 lockdowns, many 
urban animals starved because of the inter- 
ruption of normal human activities. What 
level of risk, if any, should humans accept to 
provide assistance in such instances where 
there may be conflicts between the promo- 
tion of human and animal welfare? Which 
animals should be prioritized insofar as the 
aim is to minimize suffering? . 


Philanthropic investment and prosocial 
decision-making 

Given the prosocial goal of improving wel- 
fare as much as possible, how should indi- 
viduals, businesses with corporate social 
responsibility programs, and major charita- 
ble organizations divide resources between 
animal welfare and humanitarian causes 
such as the United Nations Sustainable De- 
velopment Goals? 


LIMITS OF EXISTING TOOLS 

These issues raise several challenges, two of 
which have been the main focus of animal 
welfare science over the past several decades. 
First, animal welfare science has tried to 
identify conditions that negatively affect the 
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Mapping animal welfare proxies 


The figure outlines 47 potential animal welfare proxies for which assessments can be scored, weighted, and aggregated to produce relative estimates of species’ 
welfare potential. Colored segments indicate whether the capacity is likely present based on existing evidence (12) (left). A simple method would be to give a binary (yes 
or no) score to each proxy and weight them all equally; species with a larger number of “yes” scores would imply a greater relative welfare potential in cross-species 
comparisons (right). More sophisticated methods of scoring, weighting, and aggregation are needed to reflect specific theories of welfare and to address uncertainties. 
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The proxies are as follows: 1- Taste-aversion behavior, 2— Reward-based learning, 3— Anxiety-like behavior, 4— Fear-like behavior, 5- Flexible self-protective behavior, 6- Response modified by painkillers, 7— Parental care, 8- 
Exhaustion-like behavior, 9- Associative learning from pain, 10- Self-medication, 11- Prioritizes pain response in relevant context, 12— Relief learning, 13- Play behavior, 14- Maternal response to offspring distress, 15- Valuing 
behavior, 16— Effects on exploratory behavior from pain, 17- Hyperalgesia, 18- Panic-like behavior, 19- Depression-like behavior, 20- Social buffering, 21- Displacement behavior, 22- Loneliness-like behavior, 23— Helping 
behavior, 24— Boredom-like behavior, 25- Emotional contagion, 26- Trace conditioning and pain, 27— Emotional reactions to learning, 28- Disgust-like behavior, 29- Friendship-like behavior, 30- Sensory-affective 
dissociation, 31- Curiosity-like behavior, 32- Joy-like behavior, 33— Posttraumatic stress disorder-like behavior, 34— Rescue behavior, 35- Play vocalization, 36- Concept of death, 37- Embarrassment-like behavior, 38— 
Envy-like behavior, 39- Guilt-like behavior, 40- Jealousy-like behavior, 41- Liking-wanting dissociation, 42- Love-like behavior, 43- Mourning-like behavior, 44— Pride-like behavior, 45- Sadness-like behavior, 46- Shame-like 


behavior, and 47— Sympathy-like behavior. 


welfare of individual animals, such as crowd- 
ing of caged or penned animals or restric- 
tions that prevent the freedom to engage in 
species-specific natural behaviors. Second, it 
has tried to identify strategies for mitigating 
these negative effects (e.g., providing envi- 
ronmental enrichment for farmed animals). 
These goals have resulted in the generation 
of considerable new knowledge about the 
needs of animals and how to improve their 
welfare, as shown by influential frameworks 
such as the Five Domains model and the 
Welfare Quality model, which are represen- 
tative of a rich literature that identifies fac- 
tors relevant to animals’ quality of life. 

However, these frameworks cannot an- 
swer the policy questions described above. 
First, they are silent on the relative impor- 
tance of different types of welfare impacts 
on individual animals even within a single 
species, preventing overall judgments about 
the relative goodness or badness of policy 
options. For example, rearing chickens bred 
to grow more slowly lowers the incidence of 
keel bone fractures but also increases the 
amount of time that chickens spend in in- 
tensive confinement: How should we evalu- 
ate the trade-off between the duration and 
intensity of suffering? Likewise, a move from 
caged to cage-free systems for chickens im- 
proves freedom of movement but results in 
a higher mortality rate due to higher inci- 
dences of smothering and injurious pecking. 
When is this trade-off acceptable? 

Second, even if these frameworks were 
extended to address these intraspecies chal- 
lenges, they would still be silent on how to 
compare the overall welfare impacts on 
members of one species with the overall wel- 
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fare impacts on members of another species, 
including humans. This issue is a crucial im- 
pediment to answering questions of the type 
listed above, given that increased benefits to 
farmed or wild animals are often only possi- 
ble at some cost to humans or to each other. 
What is needed, therefore, is a method for 
aggregating, on a single scale, the net effects 
of a policy on the welfare of both animals 
and humans alike (as well as animals of dif- 
ferent species). This would enable principled 
intraspecies and interspecies welfare com- 
parisons, which are the key challenges for 
next-generation animal welfare methods. 


NEXT-GENERATION METHODS 

Researchers are beginning to develop next- 
generation methods for quantifying and 
comparing both animal and human wel- 
fare on a single scale [e.g., (7, 10-12)]. Work 
on these methods is expanding rapidly, 
fueled by increasing recognition of their 
importance for answering policy ques- 
tions such as those listed above. As a re- 
sult, the time is ripe for researchers across 
disciplines to join the effort to develop 
and refine these methods. Useful collabo- 
ration might include not only refinement 
of methods to quantify animal welfare but 
also careful formulation of policy ques- 
tions where animal welfare is relevant, as 
well as clear articulation of the needs of 
policy analysts for incorporating animal 
welfare estimates. 

There are several strategies for making 
progress. Some methods focus on anthropo- 
centric valuations of animal welfare, such as 
consumer willingness to pay for given wel- 
fare improvements (13, 14). Others try to pro- 


vide intrinsic valuations of animal welfare by 
combining measurable empirical indicators 
that may be correlated with welfare potential ‘ 
across species (10, 12), where welfare poten- 
tial is a measure of how good and bad the life 
of an individual of a given species can be. For 
example, given the differences between them, 
including differences in their respective cog- 
nitive capacities and degrees of sociality, a pig 
can likely suffer more than a python (72). 

Estimates of welfare potential might be 
based on hypothesized neural or other prox- 
ies for conscious states, cognitive and behav- 
ioral traits, or more complex multidimen- 
sional proxies (see the figure). For instance, 
some researchers have proposed defining a 
function directly linking neuron counts in the 
forebrain to welfare potential (70), resulting ‘ 
in a single score per species that can be used 
to make interspecies welfare comparisons. 
Alternatively, other research groups have + 
proposed multidimensional proxies that can 
also be aggregated to arrive at a single overall 
welfare potential score (12). Analogous meth- 
ods are used in economics to define functions 
from income (or other proxies) to individual 
human welfare. Though they are imper- 
fect and must be used judiciously, keeping 
broader moral, legal, and political consider- 
ations in mind, such proxies can be valuable 
tools in some decision contexts. 

Consider, for instance, two policies that re- 
duce greenhouse gas emissions from animal 
agriculture by the same amount. Both focus 
on intensification, but one targets pork and 
the other targets chicken. With estimates of 
the welfare potential of each species, assess- 
ments of how their quality of life will change 
under each policy, and the number of ani- 
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mals affected, it becomes possible to estimate 
which policy is better for animal welfare. 
Valuations that can inform policy may 
combine the results of several methods be- 
cause there are many uncertainties to navi- 
gate. There are anumber of dimensions about 
which there may be disagreements over the 
extent to which those dimensions contribute 
to welfare (see the figure). Analogous em- 
pirical and normative challenges have been 
addressed in other policy domains, such as 
what discount rate to use for analyses that 
require long time horizons (e.g., in climate 
policy) or how to value human mortality and 
morbidity. Sometimes addressing these chal- 
lenges requires methods such as sensitivity 
analysis—where policies are evaluated across 
a range of normative and empirical views— 
that allow progress while acknowledging the 
existence of important disagreements. In 
the discounting debate, for instance, sensi- 
tivity analyses test how the optimal climate 
policy might differ given different assump- 
tions about how much additional dollars 
of avoided climate damages generate well- 
being for the poor compared with the rich. 
Analogously, sensitivity tests reveal how 
the optimal policy regarding agricultural 
practices might differ given higher welfare 
potential estimates for animals compared 
with lower estimates. For example, some 
approaches estimate high welfare potential 
for chickens (12), whereas others estimate 
a much lower value (J0). Some policies may 
be robustly recommended under either ap- 
proach, such as low-cost interventions that 
greatly improve quality of life in intensive 
chicken farming operations. Other policies 
may not be as robust across approaches, 
such as relatively expensive interventions 
that have a smaller impact on quality of life. 
Still, given the importance of animal wel- 
fare, some policy guidance is preferable to 
none at all. Although research continues to 
uncover the intelligence, sociality, and affec- 


tive capacities of animals, there is evidence 
to suggest that important gradients in such 
capacities exist across species. Moreover, as 
researchers devote more concerted attention 
to the challenges associated with interspe- 
cies trade-offs, progress can be made on how 
to provide actionable guidance to policy ana- 
lysts, decision-makers, and stakeholders. 


ACCELERATING APPLICATION 
With the preceding discussion in mind, we 
offer the following recommendations. 


Research across disciplines to develop and 
refine methods 

Methods for making quantitative interspe- 
cies welfare estimates are in their infancy. 
Welfare economists, empirically minded 
philosophers, and animal welfare scientists 
have begun researching these methods, but 
this work could be substantially enriched 
by deep collaborations among diverse re- 
searchers, including comparative psycholo- 
gists, neuroscientists, population biologists, 
and many others. 


Prioritize readily measurable empirical 
correlates of welfare 

To enable interspecies comparisons, these 
correlates need to be specific enough that 
they represent genuine proxies for wel- 
fare but general enough that they can be 
applied to a wide range of species, thus 
allowing for sufficiently comprehensive 
analyses. Examples of correlates include 
the extent to which responses to noxious 
stimuli are modified by painkillers, which 
provides evidence for the ability to feel 
pain (see the figure). 


Develop flexible tools that are sensitive to 
normative disagreements 

Policies that improve overall welfare must 
generally make some trade-offs. Decision 
analysis tools are often designed to rank 


Emerging methods can be used to estimate welfare impacts on livestock of high stocking density. 
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policies as better or worse, such as many 
of the integrated assessment models that 
evaluate climate, air quality, or food sys- 
tems policies. Accordingly, they make 
normative assumptions that can be con- 
troversial, such as those described above 
about discount rates or how to value mor- 
tality or morbidity. The same issue will 
arise when researchers incorporate animal 
welfare into such decision analysis tools 
or when developing new tools focused on 
animal welfare. Given important disagree- 
ments about the relative importance of hu- 
man and animal welfare and the relative 
importance of different animals’ welfare, 
researchers need to develop transparent 
tools that allow stakeholders to determine 
whether results are robust across reason- 
able ethical positions in the decision con- 
text. It is crucial to develop methods for 
incorporating animal welfare directly into . 
such decision analyses so that they can 
serve their intended functions (i.e., assess- 
ing policies as better and worse). 


Discuss animal welfare in mainstream 

policy analysis ‘ 
Animal welfare has long been neglected 
in policy choices, despite its ethical im- 
portance and the public’s concern for it. 
However, it will be harder to ignore in the 
future because so many actions have im- * 
pacts on animals and attitudes are steadily 
shifting in proanimal directions (15). As 
policy debates continue, it is essential 
to highlight impacts on animals as one 
among many relevant considerations. ® 
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The future of bears 
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A deep dive into the eight living bear species emphasizes 


humanity’s role in their fate 


By Ozgiin Emre Can 


nacold but bright day in the winter 
of 2015, I found myself near the bor- 
der between Nepal and Tibet ques- 
tioning my decision to embark on a 
field study in one of the most remote 
places on Earth. The challenges and 
risks inherent in the work of a conservation 
biologist are not trivial. Yet as I 
contemplated how I had come to 
find myself in this position, I no- 
ticed three children playing next 
door with an unexpected object: 
a fluffy teddy bear. I snapped a 
photograph that I would later 
use to remind myself and my 
team of the importance of our ef- 


product of a long journey the author took 
around the world to see the planet’s eight 
bear species in their natural habitats and 
to interview researchers who study them. 
The book’s chapters are organized accord- 
ing to Dickie’s travel route, which began in 
South America, where she learned about 
spectacled bears, more commonly known 
as Andean bears (Tremarctos ornatus), in 
Ecuador and Peru; then moved 
to Asia, where her investigations 
turned to sloth bears (Melursus 
ursinus) in India, giant pandas 
(Ailuropoda melanoleuca) in 
China, and sun bears (Helarctos 
malayanus) and moon bears, or 
more accurately Asiatic black 
bears (Ursus thibetanus), in 


forts to build a better future for Eight Bears: Vietnam; and then to North 
people and for wildlife. tla re bal America, where she explored 
Three years after completing UEhde Dee: re the lives of American black 


that Himalayan field study, I felt 
similar surprise and joy while 
reading Eight Bears, which tells the stories 
of the planet’s eight extant bear species and 
includes a delightful diversion into the ori- 
gin of the teddy bear. The book, written by 
journalist Gloria Dickie—“a child obsessed 
with wild creatures” and an alumna of 
the University of Colorado Boulder—is the 
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bears (Ursus americanus) and 
brown bears (Ursus arctos) in the 
United States and polar bears (Ursus marit- 
imus) in Canada. 

This well-researched book is a pleasure to 
read and is packed with facts many readers 
will be surprised to learn. For example, in 
Britain, the royals once had a “Master of the 
Bears” to manage activities related to brown 
bears in London, and bears were used in 
bearbaiting—an activity that involved forc- 
ing bears to fight with a pack of aggressive 
dogs—until the British Parliament banned 


¢ 


Human food sources, such as dumpsters, upd: 


increase the potential for human-bear conflict. 


the practice in 1835. However, bear danc- 
ing—in which a tame bear is coerced to 
perform for entertainment purposes—was 
allowed to continue in Britain until 1911. 
Meanwhile, Dickie recounts how, in America, 
the explorer William Clark wrote in his diary 
about “a verry large and a turrible looking 
animal” in 1805, and how dozens of brown 
bears were killed during Clark’s famous ex- 
pedition with Meriwether Lewis. The book 
also introduces readers to the mythology 
that has long surrounded bears, including 
a folktale in which an Andean bear assumes 
the disguise of a human man in order to kid- 
nap young women. Through these and other 
stories, the author succeeds in portraying the 
diversity of relationships people in different 
cultures have established with bears over . 
thousands of years. 

Dickie also has a good understanding of 
the problems occurring between bears and 
the people who coexist with them. In North 
America, she reveals, the availability of an- 
thropogenic food sources is the main driver 
of conflicts with American black bears and 
brown bears. In Asia, meanwhile, expansion 
of people into bear habitats and habitat loss 
are major drivers of conflicts with Andean 
bears and sun bears. Dickie describes how 
sloth bears sometimes cause injuries and fa- 
talities, resulting in retaliatory killings. As 
she rightly concludes, human-bear conflict, 
whether perceived or real, is shaping the fu- 
ture of bears. 

Dickie’s detailed, accurate, and holistic 
view of human-bear relationships is im- 
pressive. Even bear researchers can fall 
short in this regard, focusing on narrow 
scientific questions with little consider- 
ation of the big picture. The book’s greatest 
strength thus lies in how the author taps 
into the wisdom of multiple, internation- 
ally recognized researchers and how she + 
digests and aggregates their collective in- . 
sights to give readers a comprehensive un- 
derstanding of the lives of modern bears. 
The only things missing, in my opinion, 
were a world map showing the range of all 
eight bear species and the locations of the 
field sites the author visited; representa- 
tive photographs of each species; and pho- 
tographs of people who coexist with bears 
around the globe. 

In Eight Bears, Dickie succeeds in meet- 
ing her goal of presenting “an unromanti- 
cized view of bears’ tenuous position in the 
Anthropocene.” “Losing bears,” she notes, 
“would mean we lose a beautiful and com- 
plex relationship that has paralleled our 
own journey in this world.” 

10.1126/science.adi4084 
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ANTHROPOCENE 


Altering Earth in our image 


A new documentary series traces humanity’s profound 


effects on the world around us 


By Mary Ellen Hannibal 


hane Campbell-Staton is an evolu- 

tionary biologist at Princeton Uni- 

versity, but you might mistake him 

for a seasoned journalist in Human 

Footprint, a six-part series set to air 

weekly on PBS beginning on 5 July 
2023. The series, hosted by Campbell- 
Staton, reveals how, in our relatively brief 
history, humankind has wrought profound 
transformations to ecosystems, to other spe- 
cies, and even to our own DNA. 

Human Footprint begins with an episode 
dedicated to species that have been intro- 
duced into ecosystems where they did not 
evolve. Viewers watch as Campbell-Staton 
gamely rides along with snake hunter Donna 
Kalil in the Everglades, on the hunt for non- 
native pythons released into the wild via the 
Florida pet trade. The prodigious python eats 
prey as large as deer and is busily denuding 
this delicate wetland of its historic denizens. 

Kalil likens pythons to aliens. They regen- 
erate lost teeth and can regrow hearts and 
stomachs, the latter of which they sometimes 
fill with an alligator. “Once gators are on the 
menu,’ Campbell-Staton observes, “you’ve 
reached official gangster status.” 

How exactly to deter invasives is also 
a question asked about feral horses. Wild 
horses are descended from ancestors bred 
by humans for speed and quick reproductive 
turnover. Modern horses can double their 
population every 4 to 5 years, and prolifera- 
tion of these beautiful animals is devastating 
the American West. 


The reviewer is the author of Citizen Scientist: Searching for 
Heroes and Hope in an Age of Extinction (The Experiment, 
2016). Email: maryellenhannibal@gmail.com 
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Although state and federal agencies use 
lethal deterrence to cull harmful wildlife 
populations, they are restricted by law from 
controlling horses this way. The reasoning is 
not rational. As author David Philipps puts 
it, horses are woven “very deeply into our 
idea of who we are ourselves,’ and of course 
we do not sanction shooting people either. 
Campbell-Staton reflects that “our desire to 
keep wild horses in the West might be rooted 
in our culture, but its outcome is biological.” 
In the Anthropocene, this profound truth is 
evident everywhere. 

In a nicely calibrated segue, Campbell- 
Staton takes on predation in the next epi- 
sode. “Our ancestors were hunters 


6 


Chec 


Shane Campbell-Staton examines upde 


“The Gift of Maize” in Tlaxcala, Mexico. 


as many as before the war, when elephant 
tusks were targeted for their high monetary 
value. [He has published peer-reviewed sci- 
ence documenting this phenomenon (J).] 

Is humanity an overwhelming force of 
destruction? Not if you are a dog. Episode 
3 details the evolution of wolf descendants 
to “Man’s Best Friend.” Biologist Brian Hare 
hypothesizes that wolves adapted them- 
selves to humans, rather than the other way 
around. Today, that looks like an excellent 
decision. There are ~250,000 wild wolves 
on Earth at present, many of them perse- 
cuted by humans. Their descendants that 
took another genetic path now number 
around 1 billion, and many of them are liv- 
ing large indeed. 

Episodes 4 and 5 delve further into the in- . 
timate intertwining of humanity with species 
that we have wittingly or unwittingly taken 
along on a wild ride of human domination. 
Here, viewers learn that livestock domesti- 
cated by humans for our consumption out- 
weigh all wild mammals and birds combined 
by a factor of 10. In one sense, these species 
won the lottery of survival, but chickens, for 
example, probably would not consider the 
efficient ways we breed, raise, and slaughter 
them much of a prize. 

The problem of how to feed so many of 
us without subjugating wild species is not 
addressed, but Campbell-Staton does cover a 
nifty effort to restrain domestic cats, who kill 
an estimated 1 to 3 billion birds and 7 to 10 
million small mammals per year. These crea- 

tures, “the worst invasive species 


before they were even humans,’ he 
tells viewers, remarking that pur- 


Human Footprint 


alive,’ might one day be deterred 
by AI algorithms in collars that rec- 


suing prey has shaped our bodies, ene ognize when a cat is about to hunt : 
brains, and culture. PBS, Wednesdays, and emit a bird alarm. 

Our decimation of whale popu- 5 July to The final episode of Human 
lations in pursuit of oil for fuel and 9 ee — Footprint considers cotton, and 


food, he reveals, resulted in eco- 
system depletion that continues to 
reverberate in today’s oceans. “When whales 
are not there,’ says biologist Nick Pyenson, 
“they are not fertilizing the ocean.” Food 
webs are decimated, and the ocean’s chem- 
istry is altered. 

At Mozambique’s Gorongosa National 
Park, Campbell-Staton looks for elephants 
with ecologist Dominique Goncalves. 
Goncalves runs a program to help elephant 
populations rebound after a devastating 
civil war reduced their numbers from 2500 
to 200 in 16 years. Here, Campbell-Staton 
employs his evolutionary biologist chops to 
help explain how intensive hunting by hu- 
mans changed elephant genetics. Half the 
females born to survivors of this elephan- 
ticide were tuskless, he reveals, three times 


(check local listings) 


here 
somewhat from the narrative ap- . 
proach he has taken thus far. The fate of 
more than 5 million West Africans stolen 
from their homeland to farm cotton in the 
American South had more than economic 
and ecological impacts. Campbell-Staton 
visits a geneticist from 23andMe, where 
massive genetic datasets show that more 
than a significant percentage of contempo- 
rary African Americans have white, male 
ancestry. The specter of such widespread 
and systematic rape adds a new dimension 
to the series’ repeated mantra: “What we 
do reveals who we truly are.” & 


Campbell-Staton departs ° 


REFERENCES AND NOTES 
1. S.C.Campbell-Staton et al., Science 374, 483 (2021). 


10.1126/science.adi8075 


science.org SCIENCE 


Ye, 
OWN 
sy 


S 
— 


N 
Wy 
R.,SGG 


SSRN 
DS. SS 
wou 
RINNE 


SSS OOS 
aS 


FE 
-~ T 


SSeS 
SSS 


\N 


NN 


LETTERS 


science.org SCIENCE 


24 = 7JULY 2023 © VOL 381 ISSUE 6653 


NEXTGEN VOICES 


Succinct discoveries that inspire 


We asked young scientists to “name a discovery that inspired you and then 
describe it in exactly six words.” Read a selection of their responses here. Follow 
NextGen Voices on Twitter with hashtag #+NextGenSci. —Jennifer Sills 


NEUROSCIENCE 


Prechtl’s general 
movements 

Non-invasive neuro diagnosis. 
Baby approved! 


Maria Lucrecia Cuneo, 
Ciudad Auténoma de Buenos Aires, 
Argentina 


Columnar organization 
of the neocortex 
One algorithm to rule them all 


Guthrie Phineas Dyce, 
Canberra, ACT, Australia 


The neuron 
Ramon y Cajal’s 
“enigmatic thought cell” 


Matheus de Castro Fonseca, 
Pasadena, CA, USA 


IMMUNOLOGY 


Monoclonal antibodies 
The immune system's ninja 
assassins, weaponized 
Xuan Zhang, Guangdong, China 


Autoantibodies 
The pirates of the immune system 
Ranjeet Singh Mahla, Oxford, UK 


Immune basis of allograft 
rejection 

Graft acceptance or rejection? 
Host's choice! 

Jen Li, Westmead, NSW, Australia 


INFECTIOUS DISEASE 


Ebola vaccine 
Like cure in “Outbreak” 
movie, but real 


Gina Pontes, 
Capital Federal, Buenos Aires, 
Argentina 


Smallpox vaccine 
Jenner's vaccine: Cowpox’'s 
triumph, smallpox’s defeat 


Yanpei Mai, Guangzhou, Guangdong 
Province, China 


Rinderpest vaccine 
One shot to eradicate a plague 


Anke Briining-Richardson, 
Yorkshire, UK 
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Pandemic disease triangle 


Disease: Intersection of microbe, 


environment, immunity 


Gabriela Rios-Sotelo, Portland, 
OR, USA 


BIOLOGY 


Histopathology and 
immunohistochemistry 
Rainbows under the 
microscope allow diagnosis! 


Melissa T. Duong, 
Philadelphia, PA, USA 


Epimorphic regeneration 
Salamanders regrow amputated 
limbs. Can humans? 


Leonardo M. R. Ferreira, 
Charleston, SC, USA 


Adaptation to hypoxia 
Normal hemoglobin level among 
Sherpa population 

Matiram Pun, Calgary, AB, Canada 


Polymerase chain reaction 
Technique to amplify DNA; 
thanks, bacteria! 


Anurup Mohanty, 
Bangalore, Karnataka, India 


GENETICS 


Mendelian randomization 
Nature’s RCTs are cheaper 
than humans’! 

Haoxian Tang, Guangdong, China 


CRISPR-Cas genome base 
editors 

Editing DNA like a Word 
document 

Jasleen Gill, Media, PA, USA 


Genetic influence on eye 
movements 

Where | look = who | am 

Ben de Haas, Giessen, Germany 


ECOLOGY 


Virtual water 

Commodity trading: Equivalent 
to water diversion! 

Yuan Zhi, Guiyang, Guizhou, China 


Belowground kin recognition 
Plants: 
Where family roots run deep! 


Harihar Jaishree Subrahmaniam, 
Aarhus, Denmark 


Transgenic plants 
Playing God? 

I'm just improving crops! 
Mateo Daniel Ordofiez Rivera, 
Quito, Pichincha, Ecuador 


Antifungal dual-use 
Antifungals heal; 
but also, fungicides kill. 


Norman van Rhijn, 
Manchester, Lancashire, UK 


SOCIAL SCIENCE 


Social discounting 
Generous? Maybe. 

But every choice is biased. 
Fernanda S. Oda, Lawrence, KS, USA 


Gender role congruity 
theory 

Gender roles shape leaders’ 
perceived competence. 


Lena Rudat, Mannheim, Germany 


Problem-based learning 
Code error fixed, lessons 
acquired: Jackpot! 

Ming Cheng, Hefei, Anhui, China 


CHEMISTRY 


Ciamician’s sunlight 
photochemistry 
From the rooftops 

to the benchtops 


Terry McCallum, Ottawa, 
ON, Canada 


Sabatier principle 
Optimal catalysis: 
Neither strong nor weak 


Hao Zhang, Oxford, 
Oxfordshire, UK 


Fusion ignition 
Limited energy to unlock 
limitless energy 


Rishi Jai Patel, 
Philadelphia, PA, USA 


Field-driven colloidal 
assembly 

You can treat nanoparticles 
like LEGOs! 


Ahmed Al Harraq, 
Princeton, NJ, USA 


PHYSICS 


Quantum computing 
A cat-qubit appeared 

in my dreams. 

Ginevra Fulco, Milan, Italy 


Quantum cosmology 
Supersymmetric wave function 
of the universe 

Paulo Moniz, Covilha, Portugal 


ENGINEERING 


Generalized Nash 
equilibrium 

Cutting cake’s no piece of cake. 
Yuanxing Xia, Nanjing, Jiangsu 
Province, China 


Mass customization 
Economies of scale, ¢ 
tailored for everyone 


Adrian Tymes, 
Mountain View, CA, USA 


Bruun rule : 
Sandy coasts under threat 

of erosion 

Xiangming Cao, 

Nanjing, Jiangsu, China 


COMPUTER SCIENCE 


Tortured phrases detector 
Detecting the footprints of 
paraphrased plagiarism 


Houcemeddine Turki, ¢ 
Sfax, Tunisia 


Moravec’s paradox 
Al: “It's hard to fold laundry!” 


Ju Wen, Chengdu, 
Sichuan, China 


GEOGRAPHY 


Arctic and Tibetan Plateau 
teleconnection 
Cryospheric changes 

affect the whole world! 

Xin Wan, Chaoyang District, 
Beijing, China 

Hu Line 

Geographical dividing line for 
understanding China 

Bing Xue, 

Shenyang, China 


10.1126/science.adi8737 
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Image of bacteria with helically 
shaped appendages known as flagella, 
which enable cell movement. Flagellin, the 
protein that forms the core of flagella, plays 

a key role in how host cells respond—or do not 
respond—to pathogens and commensals. 


id 
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GRAND PRIZE WINNER 
Sara Clasen 


Sara Clasen 
received her 
undergraduate 
degree from 
the University 
of Arizona in 
Tucson anda 
PhD from Johns Hopkins School 
of Medicine in Baltimore, USA. 
She is currently a postdoctoral 
fellow in Ruth Ley’s Depart- 
ment of Microbiome Science 
at the Max Planck Institute for 
Biology, Tubingen, Germany. 
Her research examines im- 
mune responses to commensal 
microbiota. www.science.org/ 
doi/10.1126/science.adi6265 
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MICROBIOME 


The sound of silence 


How flagellins from commensals evade innate immunity 


By Sara Clasen 


n 1926, the Journal of Pathology and 
Bacteriology published the immuno- 
logical notes of a little-known diphthe- 
ria researcher, A. T. Glenny. In table 
XLIII of that publication, he reported 
the notable effect of potassium alumi- 
num on the guinea pig immune response to 
diphtheria toxin (7). Animals injected with 
the toxin in the presence of aluminum had 
a much more robust antigenic response 
compared with those exposed to the toxin 
alone. Additional “adjuvants,” consist- 
ing of heat-killed bacterial extracts, were 
found by others to exert similar immuno- 
genic effects. Later, this phenomenon was 
described by the preeminent immunolo- 
gist Charles Janeway as, “the immunolo- 
gist’s dirty little secret” (2). What Glenny 
had stumbled on, and Janeway brought to 


Department of Microbiome Science, Max Planck Institute 
for Biology, Tubingen, Germany. 
Email: sara.clasen@tuebingen.mpg.de 


the fore, was the critical role of the innate 
immune system in generating adaptive im- *‘ 
mune responses. To understand how these : 
responses develop requires deeper knowl- 
edge of how innate immune signaling is 
regulated, which we studied in the context ‘ 
of the microbiome. 

It is now broadly accepted that strong 
adaptive responses require activation of 
innate immunity, and that many adjuvants 
are ligands to innate immune receptors 
(3). The latter are frequently described as 
pattern recognition receptors because they 
evolved to bind highly conserved molecu- 
lar patterns expressed by pathogens. But 
here is another little secret: These motifs 
are also produced by trillions of microbes 
that colonize the human gut, the vast ma- 
jority of which are nonpathogenic and 
even beneficial to host health. How do in- 
nate immune receptors avoid reacting to 
these ligands? 

The Ley laboratory focuses on Toll-like 
receptor 5 (TLR5) and its ligand, flagellin, 
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because their interaction exemplifies this 
conundrum. TLR5 is expressed by epithe- 
lial and immune cells, and both pathogenic 
and nonpathogenic bacteria produce flagel- 
lin proteins to build filaments for locomo- 
tion. When these filaments break down, 
TLR5 binds flagellin and initiates a pro- 
inflammatory response. Although the inter- 
action between TLR5 and Salmonella FliC 
(the flagellin protein coded by Salmonella’s 
JfliC gene) is well studied, relatively little is 
known about how the receptor responds to 
flagellins from nonpathogens, or “commen- 
sals.” We analyzed both human gut meta- 
proteome and metatranscriptomes from 
individuals without known disease to iden- 
tify relevant, commensal-derived flagellins. 

At both the protein and transcript level, 
we found that flagellins in the human gut 
are made almost exclusively by members 
of the Lachnospiraceae family (4). This 
taxonomic group is prevalent in human 
microbiomes across the globe with well- 
established roles in promoting host health 
(5). We wondered whether their flagellins 
induce innate immune responses similar 
to those caused by FIiC. 

To address this question, we character- 
ized the interaction between TLR5 and 
40 commensal-derived flagellins that are 
abundant in human gut metagenomes (4). 
We developed a TLR5-flagellin binding 
assay to infer recognition and measured 
nuclear factor kB (NF-«B) signaling in re- 
porter cells to assess TLRS activity. Recog- 
nition correlates with activation for most 
of the candidates. But for ~25%, receptor 
recognition is decoupled from activation— 
the flagellins bind the receptor but fail to 
induce the pro-inflammatory response. 

We called this response “silent recogni- 
tion” by TLR5 and explored howthese “silent 
flagellins” are seen—but ignored—by in- 
nate immunity. The silent flagellin RAFlaB, 
produced by the common gut resident 
Roseburia hominis, showed the strongest 
binding to TLR5 in our assay, and we used it 
to further interrogate how the TLR5-silent 
flagellin interaction differs from that of 
TLR5-FIiC. Most silent flagellins are nearly 
identical to FliC in regions characterized as 
crucial for TLR5 activation (6). But when 
we mutated the primary interaction site in 
flagellin, we saw different effects on bind- 
ing to TLR5: The RAFlaB mutant showed 
a complete loss of binding, whereas the 
FliC mutant showed no change relative 
to the wild type (4). The resilience of the 
FliC mutant in binding TLR5 indicated the 
presence of a secondary, allosteric binding 
site in FliC that was missing from RAFlaB. 
We mapped its location then explored its 
effect by transplanting it into silent flagel- 
lins. As expected, the allosteric binding 
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site restored the RAFlaB mutant’s ability 
to bind TLR5. And unexpectedly, it made 
wild-type RAFlaB orders of magnitude bet- 
ter at activating TLR5. 

How does this binding site exert such 
a large impact on TLR5 signaling? For 
answers, we looked to the mechanism be- 
hind TLR5 activation. Flagellin is thought 
to induce receptor dimerization because 
TLR5 is a homodimer in its active state (7). 
Therefore, we explored whether the allo- 
steric binding site facilitates this process. 
The effort bore unexpected fruit: Although 
we did not detect FliC-induced dimeriza- 
tion, we did discover that a substantial 
fraction of TLR5 on the cell surface exists 
as a dimer in the absence of flagellin (4). 
The FIiC allosteric site mediates binding to 
these preformed dimers. Silent flagellins, 
consequently, cannot bind dimeric TLR5 


FINALIST 
Christopher J. Stewart 


Christopher Stewart 
received under- 
graduate degrees 
and a PhD from Nor- 
thumbria University. 
After completing his 
postdoctoral fellow- 
ship at Baylor College of Medicine, he 
started his laboratory in the Transla- 
tional and Clinical Research Institute 
at Newcastle University in 2018. His 
research is focused on microbial-host 
interaction in the gut of infants born 
prematurely, using multi-omic analy- 
sis of clinical samples in combina- 
tion with basic microbiology and an 
experimental organoid coculture sys- 
tem, to develop disease biomarkers 
and targeted therapeutic interven- 
tions to promote health. www.science. 
org/doi/10.1126/science.adi6318 
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Christoph A. Thaiss 


Christoph A. Thaiss 
received his under- 
graduate degrees 
from the University 
of Bonn, Yale Univer- 
sity, and ETH Zurich 
and a PhD from 

the Weizmann Institute of Science. 
After completing graduate school, he 
started his laboratory in the Micro- 
biology Department of the Perelman 
School of Medicine at the University 
of Pennsylvania in 2018. His research 
studies the multifaceted interactions 
between environmental factors, the 
gut microbiome, the immune system, 
metabolism, and the brain. www. 
science.org/doi/10.1126/science.adi6329 


unless they are equipped with the alloste- 
ric site from FliC. Thus, evasion of TLR5 
dimers enables silent flagellins to avoid ac- 
tivating the receptor. 

Our discovery of silent flagellins illus- 
trates one way by which innate immune 
receptors tolerate ligands from commen- 
sals. To contextualize these findings, we 
explored the prevalence of silent flagel- 
lins in the human gut. We screened ad- 
ditional flagellins from nearly a hundred 
commensal bacterial species to expand 
our list of silent flagellins, then measured 
their abundance in more than 1700 human 
gut metagenomes from around the world. 
Our analysis revealed that silent flagellins 
are ubiquitous. However, their levels are 
substantially lower in populations from 
industrialized regions (4). The factors be- 
hind this loss are unclear, as are the conse- 
quences for human health. 

What is increasingly apparent is that 
Lachnospiraceae flagellins can become the 
targets of antigenic responses, and these 
responses are overrepresented in disease 
states, including inflammatory bowel dis- 
ease (IBD) (8, 9). The antibodies elevated 
in IBD target flagellins that strongly stimu- 
late TLRS, as well as silent flagellins. How- 
ever, TLR5’s role in this process is not well 
understood. Notably, the prevalence of IBD 
is much higher in industrialized countries 
(10) and correlates with reduced silent fla- 
gellin abundance in these populations. 

Since Janeway’s remarks in 1989, many 
insights have been made into how innate 
immunity enhances adaptive responses. Al- 
though vaccine adjuvants have long shown 
how this relationship can be harnessed for 
public health, antigenic responses to com- 
mensal microbiota indicate that it can also 
be detrimental. Our continued work on the 
interaction between innate immune recep- 
tors and the microbiome—using methods 
from both computational and molecular 
biology (/1)— is critical to understand how 


these maladaptive responses develop and . 


to mitigate their effects. 
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degrees and 
a PhD from 
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University. After completing his 
postdoctoral fellowship at Baylor 
College of Medicine, he started 
his laboratory in the Transla- 
tional and Clinical Research 
Institute at Newcastle University 
in 2018. His research is focused 
on microbial-host interaction in 
the gut of infants born prema- 
turely, using multi-omic analysis 
of clinical samples in combina- 
tion with basic microbiology 
and an experimental organoid 
coculture system, to develop 
disease biomarkers and targeted 
therapeutic interventions to 
promote health. www.science.org/ 
doi/10.1126/science.adi6318 
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Diet-microbe-host interaction 


in early life 


Breast milk bioactives are important for infant 


microbiome and immunity 


By Christopher J. Stewart 


reast milk is a complex biofluid 

crafted over millions of years of 

evolution to provide optimal nutri- 

tion along with other bioactive com- 

pounds to the infant. Human milk 

oligosaccharides (HMOs) are com- 
plex sugars that represent the fourth most 
abundant component of breast milk, which 
makes them more numerous than proteins 
and other vital nutrients, yet they provide 
no direct nutritional benefit to the infant 
(1). Human evolution does not favor waste- 
ful processes, which suggests that HMOs 
play another crucial role in infant develop- 
ment. Recent work has borne this out and 
highlighted HMOs’ ability to act as prebi- 
otics. To improve human health through- 
out life by means of targeted intervention, 
it is paramount to better understand the 
three-way interaction between diet, mi- 
crobe, and host. 

The environmental determinants of dia- 
betes in the young (TEDDY) study collected 
over 12,000 longitudinal stool samples from 
more than 900 infants and found that re- 
ceipt of breast milk was the most important 
factor shaping the infant gut microbiome 
throughout the first year of life, after which 
breastfeeding seldom occurred (2). Taxo- 
nomic analysis showed that Bifidobacte- 
rium spp. were closely linked to receipt of 
breast milk, which followed on previous 
work that described the distinctive genetic 
capacity of Bifidobacterium species to uti- 
lize HMOs from breast milk (3). 

Such work has been performed in the 
context of infants born at full term, but a 
major research gap relates to diet-microbe- 
host interaction in infants born prema- 
turely. This important population has been 
the focus of my group’s recent work, which 
revealed that HMOs modulate the preterm 
infant gut microbiome, increase Bifidobac- 
terium colonization, and reduce disease 
risk. Further, probiotics provide the means 
to deliberately modulate the preterm gut 
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microbiome in a highly specific and repro- 
ducible way. 

Unlike infants born at full term, extremely 
preterm infants (i.e., <32 weeks’ gestation) 
have an immature intestinal architecture 
and an underdeveloped immune system. 
They are also less likely to be breastfed and 
receive limited exposure to microbes dur- 
ing the first months of life, which leads to a 
reduction in potentially beneficial bacteria. 
This reduction has been implicated in the 
pathogenesis of necrotizing enterocolitis 
(NEC), a serious inflammatory-mediated 
bowel disease that represents the leading 
cause of death in this population (4). Al- 
though the risk factors for NEC are com- 
plex, the most protective factor against 
disease for these infants is unequivocally 
receiving their mother’s own breast milk. 

Because the preterm gut can become 
leaky, translocation of microbes into the 
bloodstream and/or intestinal cell death 
represent major problems. However, evi- 
dence from my research and others has 
shown that certain types of bacteria, most 
notably HMoO-utilizing Bifidobacterium, 
may increase gut and immune maturation 
(5-8). Other work has also suggested a pos- 
sible role of specific HMOs in protection 
against NEC (9, 10). 

We recently expanded upon this research 
in our investigation of HMOs in moth- 
ers’ milk and longitudinal infant gut mi- 
crobiome samples in 33 NEC cases and 37 
matched controls. HMO profiling revealed . 
that a single HMO, disialyllacto-N-tetraose 
(DSLNT), was lower in breast milk received 
by infants who went on to develop NEC 
and could predict disease with high accu- 
racy (J1). We used univariate modeling to 
determine an optimal DSLNT threshold for 
disease risk and modeled how the develop- 
ing preterm gut microbiome was affected 
by receipt of DSLNT above or below that 
threshold. Infants that received milk with 
DSLNT below the threshold were associ- 
ated with slower microbiome maturation 
and reduced colonization by preterm gut 
community types (PGCTs) dominated by B7- 
fidobacterium species. Specifically, healthy 
babies who received adequate DSLNT 
showed quicker progression to potentially 
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beneficial PGCT-4 (dominated by B. longum 
and B. bifidum) and PGCT-5 (dominated by 
B. breve). 

Given these important links between 
HMOs in breast milk and PGCT coloniza- 
tion, we asked if different probiotic prod- 
ucts could be used to effectively tailor 
preterm gut microbiome development (12). 
We leveraged the Great North Neonatal 
Biobank, which has collected daily samples 
since 2010 to enable research into the pre- 
term gut microbiome and immune develop- 
ment. In this time, the neonatal intensive 
care unit (NICU) at Great North Children’s 
Hospital in Newcastle, UK, started using 
probiotics, initially Infloran in 2013 and 
then Labinic in 2016. Although the two pro- 
biotic products included the same species 
(B. bifidum and Lactobacillus acidophilus), 
with the addition of B. longum in Labinic, 


the strains for each product were distinct. 
This difference in strains led to consistent 
and important impacts on the preterm gut 
microbiome, such that infants that received 
Labinic quickly transitioned to PGCT-4 and 
infants receiving Infloran transitioned to 
PGCT-5. Infants in the NICU before probi- 
otics were introduced lacked colonization 
by Bifidobacterium and instead were domi- 
nated by pathobionts including Escherichia 
and Klebsiella (see the figure). 

Although associations derived from 
“omic” analysis of clinical samples provide 
important insights, it is not possible to 
determine causality or underlying mecha- 
nisms. We recently described a protocol for 
the generation of human intestinal organ- 
oids from preterm gut tissue (73) and en- 
gineered an intestinal organoid physoxic 
co-culture (IOPC) system that allows simul- 


taneous growth of anaerobic bacteria and 
intestinal organoids (14) (see the figure). We 
then used the IOPC system to demonstrate 
that preterm ileum-derived organoids have 
distinct gene expression when compared to 
adult ileum-derived organoids (/5). 

Using the preterm intestinal-derived 
organoid model revealed critical links be- 
tween PGCTs, namely, that PGCT-4 and 
PGCT-5 had distinct host epithelial tran- 
scription when compared to the other 
PGCTs and media-only control (72). Metabo- 
lomic analysis of stool samples also found 
that PGCT-4 and PGCT-5 were more com- 
parable functionally, despite differences in 
microbial taxonomy, and were distinct from 
the nonprobiotic PGCTs 1-3 (see the figure). 
Such research has important implications 
for the design of preterm infant probiotics 
and ultimately leads one step closer to per- 


Overview of preterm work spanning descriptive microbiome sequencing, functional metabolomic 
profiling, and mechanistic organoid coculture 


(Top) Timeline of probiotic use in preterm infants reveals a strain-specific impact on taxonomic composition. Large circles represent bacterial taxonomy, and small 
circles represent the functional metabolites in preterm infant stool. (Bottom) Schematic representation of the process of establishing, expanding, and experimenting 


with preterm intestinal-derived organoids. 
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sonalized medicine for gut health. 

Further work is ongoing to validate 
DSLNT as a biomarker for disease and to 
understand the potential mechanism to en- 
able targeted therapeutic intervention us- 
ing HMOs and HMO-utilizing probiotics. 
Better understanding of the diet-microbe- 
host interaction holds incredibly exciting 
possibilities to predict, diagnose, and ma- 
nipulate the microbiome of preterm infants 
to reduce disease risk. 
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A microbiome exercise 


Gut-brain connections drive the motivation to work out 


By Christoph A. Thaiss 


xercise is arguably the most potent 

and attainable lifestyle factor that 

individuals can use to safeguard 

themselves against a large variety of 

diseases, including metabolic, car- 

diovascular, neurodegenerative, and 
neoplastic diseases (J). Despite its vast 
health benefits, only a small fraction of the 
population adheres to the World Health 
Organization’s recommendation of 150 to 
300 minutes of moderate-intensity exer- 
cise per week (2). Busy schedules and rainy 
weather aside, what are the physiological 
bottlenecks that prevent us from engaging 
in regular physical activity? 

My research team at the University of 
Pennsylvania, spearheaded by PhD student 
Lenka Dohnalova, set out to answer this 
question using a large cohort of genetically 
diverse outbred mice. We observed marked 
interindividual variability in exercise capac- 
ity, with the top performers outrunning the 
weakest animals by more than 10-fold (3). We 
then carried out deep molecular and physi- 
ological profiling of these mice and used a 
machine-learning approach to nominate 
individual measurements that predict tread- 
mill and wheel running. To our surprise, ele- 
ments of the gut microbiome topped the list. 
We thus probed the functional importance 
of the microbiome in exercise performance 
and carried out ergometric measurements 
under gnotobiotic conditions (i.e., with de- 
fined microbial colonization). We noted that 
the absence of the microbiome in germ-free 
or antibiotics-treated mice diminishes vol- 
untary and endurance exercise capacity by 
~50%. We determined that this effect of the 
microbiota was driven by certain taxonomic 
elements, including members of the Lachno- 
spiraceae family. These experiments suggest 
that the gut microbiome may be an impor- 
tant contributor to the interindividual vari- 
ability in physical activity. 

The question of how the microbiome 
affects exercise performance led us to an 
intriguing discovery. Contrary to our ex- 
pectations, the effects of intestinal mi- 
crobes on physical performance were not 
mediated by tissues classically associated 
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with exercise, such as skeletal muscle. 
Rather, the microbiome appeared to fun- 
damentally influence the impact of exer- 
cise on the brain. Intense physical activity 
stimulates numerous neurochemical re- 
sponses in the brain, including a surge in 
dopamine release in the striatum (4). And 
interestingly, in the absence of the micro- 
biome, this surge was severely diminished. 

The release of striatal dopamine is 
known to cause feelings of pleasure, moti- 
vation, and reward (5)—all elements of the 
“runner’s high” phenomenon that some 
individuals experience after intense ex- 
ercise. On the basis of these findings, we 
speculated that the microbiome may en- 
hance exercise capacity through its ability 
to modulate exercise-induced dopamine 
responses. Through a series of pharmaco- 
logical and chemogenetic interventions, 
we found that dopamine deficiency was 
indeed the cause for reduced exercise in 
microbiome-depleted animals. Consis- 
tently, restoring dopamine signaling fully 
restored their running ability (3). 

How do microbes that reside in the gas- 
trointestinal tract control neurotransmit- 
ter levels in the brain? The communication 
between the microbiome and the brain can 
occur either through microbial molecules 
entering the systemic circulation or through 
direct neuronal connections between the gut 
and brain (6). We did not find evidence for a 
microbiome-derived systemic mediator that 
influences exercise capacity. By contrast, 
inhibiting gut-innervating sensory neurons 
recapitulated the effect of microbiome de- 
pletion on exercise, whereas stimulation of 
sensory afferents restored exercise capacity 
even in the absence of the microbiome. 

To determine the mechanisms by which 
the microbiome influences the activity of 
sensory neurons during exercise, we com- 
bined microbial genetics, metabolomics, 
and neuronal recordings. We identified a 
pathway whereby microbial production of 
fatty acid amides (FAAs) enhanced sensory 
neuron activity through the endocannabi- 
noid receptor CB1. This increase in neuronal 
activity, in turn, boosted striatal dopamine 
signaling during exercise (see the figure). 
Intriguingly, this pathway can be harnessed 
to modulate exercise performance through 
gastrointestinal interventions. For instance, 
colonizing germ-free mice with genetically 
engineered FAA-producing bacteria or feed- 
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A meta-organismal pathway regulates the motivation for exercise 
Microbiota-derived fatty acid amides are sensed by the endo-cannabinoid receptor CB1 on gut-innervating 
neurons, which enhances their activity during exercise. Sensory neuron activation, in turn, augments dopamine 
signaling in the striatum and thereby drives exercise performance. 


Microbiota 


ing antibiotics-treated mice an FAA-con- 
taining diet restored their exercise capacity. 
However, when we used a peripheral CB1 
inhibitor or a central dopamine receptor 
blocker, the positive effects of these inter- 
ventions were negated (3). 

Our finding has several important im- 
plications. It suggests that gastrointestinal 
signals can serve as a regulating factor on 
the activity of exercise-responsive neurons 
in the brain, potentially enhancing the 
generation of the “runner’s high” feeling. 
The evolutionary purpose of this gut-brain 
pathway is yet to be determined, but we 
speculate that it may link an animal’s de- 
cision to engage in prolonged physical 
activity to the nutritional status of the gas- 
trointestinal tract. 

The mental state plays a crucial role in 
exercise participation and performance, 
both in elite athletes and hobby runners 
(7). Our discovery of a microbiome-depen- 
dent pathway may contribute to a better 
understanding of the factors that motivate 
some individuals to reach their peak per- 
formance, whereas others prefer to observe 
from the sidelines. The malleability of the 
microbiome and its metabolites offers a 
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potential avenue to increase exercise en- 
gagement and its health benefits globally. 
Striatal dopamine is a critical mediator 
of motivated behavior beyond just exercise. 
Our discovery opens up the possibility of 
leveraging this pathway to influence other 
dopamine-dependent brain functions, 
such as learning, mood, and addiction. 
Further explorations into the vast universe 
of microbiome-derived molecules that in- 
fluence neuronal activity will provide valu- 
able insights into the potential of shaping 
the mind through the gastrointestinal tract. 
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BIOGEOGRAPHY 


When continents collide 


he movement of species across newly connected 


continents millions of years ago still shapes flora 
and fauna today. Skeels et al. showed that spe- 
cies’ dispersal ability, climate tolerances, and the 
climate in which they evolved help to explain why 
biotic exchanges are typically unequal, with more spe- 
cies spreading from one continent than the other. Using 
a model simulating species ranges and diversification 
paired with paleoenvironmental reconstructions, the 
authors found that precipitation tolerance influenced 
vertebrate species movements across Wallace's Line, 
which separates the distinct biota of Australia and 
New Guinea from that of Southeast Asia. Species 
that evolved in dry Australia were less able to cross 
to Asia, whereas the swath of tropical forest across 
the region allowed more species to move in the 
other direction through New Guinea. —BEL 


Science, adf7122, this issue p. 86 


The Sulawesi bear cuscus (Ailurops 
ursinus) is, in fact, a marsupial 
species endemic to the region around 
Wallace’s Line. 


BATTERIES 
Rapid ionic transport in 
inorganic solids 


Solid electrolytes have been 
developed to allow solid-state 
batteries to compete with liquid 
electrolyte batteries, but there 

is still a challenge in ensuring 
good contact with the electrodes, 
which thus limits the thickness 

of electrodes that can be used in 
practice. Li et al. applied the prin- 
ciples of high-entropy materials 
to improve upon the properties 
of existing electrolytes through 
the sensible partial substitution 
of addition elements. The authors 
demonstrate the incorporation 
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of their materials into a cathode 
structure, which provides the 
high ionic conductivity required 
for solid-state battery applica- 
tions. —MSL 

Science, add7138, this issue p. 50 


STRUCTURAL BIOLOGY 
Snapshots of ribosome 
dynamics in human cells 


Ribosomes are molecular 
machines that translate genetic 
information into proteins. Their 
active site is the target of many 
small molecules, including 
several antibiotics and the anti- 
cancer drug homoharringtonine. 


Although structural biologists 
have studied how small mol- 
ecules interfere with ribosome 
function in the test tube, Xing 
et al. now reveal structures of 
translating ribosomes within 
human cells using cryo—electron 
tomography. The authors visual- 
ized translation intermediates at 
high resolution and elucidated in 
detail how homoharringtonine 
acts on ribosomes. This work 
marks a technical advance in 
solving high-resolution struc- 
tures from inside human cells 
and adds to our understanding 
of this drug's effect on cells. 
—SMH 

Science, adh1411, this issue p. 40 
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ORGANIC CHEMISTRY 
Put a ring in it 


Traditionally, chemists make 
complex molecules by first 
assembling the central frame- 
work and then modifying the 
periphery. More recently, 
however, there has been an 
increased focus on late-stage 
framework editing by inserting or 
removing individual atoms. Wang 
et al. now report the insertion of 
a whole ring into a preexisting 
ring. Specifically, they relied on 

a photoredox catalyst to oxidize 
thiophene derivatives, activating 
them for the insertion of highly 
strained bicylobutane. 
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The bicyclic products feature an 
unusual eight-membered ring 
motif. —JSY 

Science, adh9737, this issue p. 75 


MOLECULAR BIOLOGY 
Kissing and kicking in 
transcription 
The spatial organization of chro- 
matin and specific factors within 
the nucleus regulates gene 
expression. To visualize nuclear 
organization at the molecular 
scale in developing embryos, 
Pownall et al. developed chro- 
matin expansion microscopy 
(ChromExM), which drastically 
improves the resolution of 
light microscopy by physically 
expanding embryos embed- 
ded ina series of swellable 
hydrogels without distorting 
chromatin organization (see the 
Perspective by Stasevich and 
Kimura). This procedure enabled 
the visualization of individual 
nucleosomes and revealed 
how the pioneer factor Nanog 
interacts with them. The authors 
also observed RNA polymerase 
|| nanostructures engaged in 
active transcription, and devel- 
oped a “kiss and kick” model of 
enhancer-promoter interactions 
in which transcription elongation 
leads to the physical separation 
of enhancers and promoters. 
—DJ 

Science, ade5308, this issue p. 92; 

see also adi8187, p. 26 


QUANTUM SIMULATION 
Watching fermions pair 


Ultracold atoms in optical lat- 
tices can be used to gain insight 
into the behavior of interact- 
ing quantum matter. Some of 
the most general physics that 
can be simulated is accessed 
through the Fermi-Hubbard 
model, which is relevant to 
many solid-state systems. 
Hartke et al. placed attractively 
interacting fermionic atoms of 
potassium into an optical lattice 
and used a quantum micro- 
scope to image the system and 
measure correlations between 
the lattice sites. Their results 
suggest the formation of nonlo- 
cal atomic pairs with a size that 
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depends on the strength of 
attraction. —JS 
Science, ade4245, this issue p. 82 


CELL CYCLE 
RhoGEF hitches a ride in 
dividing cells 
In cytokinesis, the final phase of 
cell division, the plasma mem- 
brane progressively pinches 
inward along the cell equator. 
Warecki and Tao investigated 
how the activation of the enzyme 
RhoA is precisely localized to 
enable this process. In a purified 
protein system, the activator 
of RhoA, RhoGEF, accumu- 
lated at microtubule ends due 
to a motor protein complex 
called centralspindlin. In cells, 
this accumulation of RhoGEF 
sustained RhoA activity and 
cytokinesis progression at the 
cell equator. The findings sup- 
port a long-held hypothesis for 
how cells locally activate RhoA 
during cytokinesis. —LKF 
Sci. Signal. (2023) 
10.1126/scisig.eadhO601 


HIV 


bNAbs for babies 


Broadly neutralizing antibodies 
(bNAbs) have shown promise 
as a companion or alternative 
to antiretroviral therapy (ART) 
for HIV-1, although their efficacy 
in the pediatric population 
remains unclear. Shapiro et al. 
report the results of a pro- 
spective clinical trial in which 
children on ART from birth in 
Botswana were treated with two 
HIV-1 bNAbs. The treatment was 
administered initially in combi- 
nation with ART and was then 
given alone. Eleven of 25 infants 
maintained viral suppression 
during the bNAb-only step, and 
the authors found that these 
infants had more favorable 
HIV-1 reservoir characteristics, 
including a smaller initial provi- 
ral reservoir and susceptibility 
of those proviruses to bDNAb 
neutralization. These results 
highlight the potential of DNAb 
treatment for infants and chil- 
dren living with HIV-1. —CSM 
Sci. Transl. Med. (2023) 
10.1126/scitransImed.adhO004 
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CONSERVATION 
The human predator 


Predators are formative compo- 
nents of ecosystems. Although 
modern humans tend not to 
think of ourselves as predators, 
ecologically, we are the preda- 
tor with the broadest and largest 
impact across species. Darimont 
et al. surveyed more than 45,000 
vertebrate species and estimated 
that over a third are “preyed” upon 
by humans through hunting and 
the pet trade, among other activi- 
ties. Removal of these species by 
humans may have ecosystem- 
level impacts, as 40% of harvested 
species are threatened by our 
exploitative consumption. —SNV 
Commun. Biol. (2023) 
10.1038/s42003-023-04940-w 


PREDATION 
Lesser fleas ad infinitum 


We understand strategic 
aspects of the binary lifestyles 


of bacteria chiefly from model 
species. However, bacteria fall 
prey to smaller bacteria, the 
growth and division of which may 
be regulated in different ways. 
Using single-cell methods, Santin 
et al. systematically explored the 
life cycle of the bacterial preda- 
tor Bdellovibrio bacteriovorus. 
This predator forms nonbinary 
replicative filaments within com- 
partments in bacterial prey cells, 
which generate variable numbers 
of daughter cells. Larger prey cells 
mean more offspring than smaller 
cells, but it is the nutritional 
quality of the prey (experimen- 
tally manipulated through the 
growth medium) that ultimately 
determines the growth rate of the 
predator. That the timing of key 
cellular processes scales with the 
size and quality of prey cells indi- 
cates that this tiny predator can 
adjust its life cycle to the variability 
of the prey it encounters. —CA 
Curr. Biol. (2023) 
10.1016/j.cub.2023.04.059 
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MICROBIAL ECOLOGY 
Leaf microbiome 
interactions 


Plant surfaces host complex 
microbial communities. Carbon 
source composition dictates 
whether microbes can live there, 
but species interactions make it 
difficult to predict which microbe 
combinations will thrive ona 
given substrate. Schafer et al. 
examined the carbon source 
preferences of 224 bacterial 
strains from plant leaves and 
used metabolic genome-scale 
models to simulate pairwise 
interactions. Most interactions 
were found to be negative, with 
bacteria growing less well in 
pairs than in isolation. Metabolic 
flux analyses indicated that 
occasional positive interactions 
may occur for versatile strains 
that use amino acids and organic 
acids more easily. This work 
may inform the construction of 
synthetic microbial communities 
according to substrate. -MRS 
Science, adf5121, this issue p. 42 


STRUCTURAL BIOLOGY 


Guide RNA editing 


In the parasitic protozoan 
Trypanosoma brucei, most 
mitochondrial genes are cor- 
rupted, and their transcripts 
must be repaired to restore 
coding capacity. Directed by 
guide RNAs (gRNAs), a structure 
called the editosome executes 
a cascade of messenger RNA 
(mRNA) cleavage, uridine 
insertions and deletions, and 
religation. Using cryo—electron 
microscopy and molecular 
approaches, Liu et al. visualized 
states of the substrate-binding 
complex responsible for gRNA 
stabilization and gRNA-mRNA 
duplex formation. This work 
provides a detailed framework 
for understanding mRNA editing 
by the editosome. —DJ 

Science, adg4725, this issue p. 43 
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STRUCTURAL BIOLOGY 
Making the human 
ribosome large subunit 


Human ribosome biogenesis is a 
highly regulated process involv- 
ing avast ensemble of assembly 
factors that orchestrate the 
formation of both ribosomal 
subunits. The mechanisms 
underlying the early assembly 
of the human large ribosomal 
subunit (pre-60S) remain 
elusive. To provide insights into 
human ribosome assembly, 
Vanden Broeck and Klinge used 
a combination of genome edit- 
ing, cryo—electron microscopy, 
and functional studies to reveal 
the structures of human pre-60S 
particles as they matured in the 
nucleolus and nucleus. This col- 
lection of structures highlights 
how assembly factors ensure 
the unidirectional maturation of 
human large ribosomal subunit 
precursors. —DJ 

Science, adh3892, this issue p. 44 


FOREST PHENOLOGY 
Opposite warming effects 
by season 


Global warming is changing 

the timing of photosynthesis, 
with leaves emerging earlier 

in spring in the temperate and 
boreal zones. A longer grow- 

ing season could mean greater 
carbon sequestration in forests, 
but the timing of leaves falling 
in autumn depends on multiple 
cues, making it difficult to pre- 
dict. Zohner et al. investigated 
how leaf senescence relates to 
day length, temperature, and 
early-season photosynthesis 
across northern forests using 
remote sensing, ground observa- 
tions, and experimental data. 
They found that warming had 
opposing effects on senescence 
dates depending on when it 
occurred: Warmer springs with 
higher photosynthesis corre- 
lated with earlier senescence, 
wheras warmer temperatures 
in autumn delayed senescence. 
Incorporating this shift in 
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knowledge may improve predic- 
tions of vegetation response to 
climate change. —BEL 

Science, adf5098, this issue p. 45 


PHYSICS 
Sizing up the electron’s 
dipole moment 


The puzzling imbalance between 
matter and antimatter in the 
universe can be explained by 
the breaking of charge parity 
symmetry. The standard model 
of particle physics does predict a 
slight breaking of this symmetry 
but is insufficient to explain the 
observations. Many extensions 
to the standard model have 
been proposed to resolve this 
discrepancy. To test such model 
extensions, tabletop experiments 
that measure the electron's 
electric dipole moment (eEDM), 
a measure of symmetry break- 
ing, have looked very promising. 
Roussy et al. exploited the large 
electric fields inside the polar 
molecular ions of hafnium 
fluoride to measure the eEDM to 
extremely high precision (see the 
Perspective by Fan and Jayich). 
The uncertainty of the measure- 
ment compares favorably to 
those achievable through acceler- 
ator-based experiments. —JS 
Science, adg4084 this issue p. 46; 
see also adi8499, p. 28 


PLANT SCIENCE 
Guard cell specification 


Many tissue-patterning processes 
involve asymmetric cell divisions 
that determine subsequent cell 
fate. Working with the Arabidopsis 
guard cell lineage, Muroyama et 
al. found that the polarly localized 
proteins BASL/BRXf generate a 
region where cell division planes 
cannot occur (see the Perspective 
by Goldy and Caillaud). BASL/ 
BRXf locally suppressed forma- 
tion of the preprophase band, 
which then formed elsewhere 

to appropriately place the cell 
division site. When the prepro- 
phase band was disrupted, 
neither guard cells nor epidermal 


pavement cells formed. Within the 
polarity domain, organizational 
dynamics of microtubule plus- 
ends were disrupted, depleting 
microtubules from the zone. This 
study uncovers a mechanism 
for plant tissue patterning that 
ensures the inheritance of cell 
fate determinants to a single 
daughter cell. -MRS 

Science, add6162, this issue p. 54; 

see also adi6664, p. 27 


SOLAR CELLS 
Improving perovskite/ 
silicon solar cells 


Two studies show how inter- 
faces between perovskite layers 
and silicon cells in tandem 
solar cells can be modified to 
improve performance (see the 
Perspective by De Wolf and 
Aydin). Mariotti et al. showed 
that an ionic liquid, piperazinium 
iodide, improved band alignment 
and enhanced charge extraction 
at the interface of a trihalide 
perovskite and a C electron- 
transporting layer by creating 
a positive dipole. With these 
modifications, a 2.0-volt open 
circuit voltage was achieved in 
asilicon tandem cell. Chin et al. 
report the uniform deposition 
of the perovskite top cell on 
the micropyramids of crystal- 
line silicon cells to achieve high 
photocurrents in tandem solar 
cells. Two different phosphonic 
acids improved the perovskite 
crystallization process and also 
minimized recombination losses. 
These modifications yielded 
perovskite/silicon tandem cells 
with certified power conversion 
efficiencies of more than 31% for 
active areas of at least 1 square 
centimeter. —PDS 
Science, adgO091, adf5872 
this issue p. 59, p. 63; 
see also adi6278, p. 30 


IMMUNE REGULATION 


Pinning down KIR3DL3 


Human killer immunoglobulin- 
like receptor (KIR) KIRSDL3 
has been characterized as a 
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conserved but polymorphic 
inhibitory KIR that can recog- 
nize the protein HHLA2, which 
is up-regulated in tumors, as 
a ligand. Palmer et al. used 
high-resolution flow cytometry 
and single-cell transcriptomics 
to characterize KIR3DL3- 
expressing cells. They detected 
enriched expression of KIR3DL3 
in y8 and CD8* T cells but not 
in natural killer cells. KIR3DL3- 
expressing cells were most 
frequently found in the lungs and 
digestive tract and were rare in 
peripheral blood and thymus. 
Blood KIR3DL3* T cells had an 
activated transitional memory 
phenotype and were consid- 
ered hypofunctional. KIR3DL3 
ligation could inhibit T cell recep- 
tor—mediated stimulation, and 
KIR3DL3 polymorphisms were 
most likely to reduce expression. 
KIR3DL3 expression appears 
to be up-regulated in parallel or 
downstream from T cell receptor 
stimulation, and individuals can 
have varied KIR3DL3 expres- 
sion that can affect immune 
responses. —CNF 
Sci. Immunol. (2023) 
10.1126/sciimmunol.ade5343 
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The bicyclic products feature an 
unusual eight-membered ring 
motif. —JSY 

Science, adh9737, this issue p. 75 


MOLECULAR BIOLOGY 
Kissing and kicking in 
transcription 


The spatial organization of chro- 
matin and specific factors within 
the nucleus regulates gene 
expression. To visualize nuclear 
organization at the molecular 
scale in developing embryos, 
Pownall et al. developed chro- 
matin expansion microscopy 
(ChromExM), which drastically 
improves the resolution of 
light microscopy by physically 
expanding embryos embed- 
ded ina series of swellable 
hydrogels without distorting 
chromatin organization (see the 
Perspective by Stasevich and 
Kimura). This procedure enabled 
the visualization of individual 
nucleosomes and revealed 
how the pioneer factor Nanog 
interacts with them. The authors 
also observed RNA polymerase 
|| nanostructures engaged in 
active transcription, and devel- 
oped a “kiss and kick” model of 
enhancer-promoter interactions 
in which transcription elongation 
leads to the physical separation 
of enhancers and promoters. 
—DJ 

Science, ade5308, this issue p. 92; 

see also adi8187, p. 26 


QUANTUM SIMULATION 
Watching fermions pair 


Ultracold atoms in optical lat- 
tices can be used to gain insight 
into the behavior of interact- 
ing quantum matter. Some of 
the most general physics that 
can be simulated is accessed 
through the Fermi-Hubbard 
model, which is relevant to 
many solid-state systems. 
Hartke et al. placed attractively 
interacting fermionic atoms of 
potassium into an optical lattice 
and used a quantum micro- 
scope to image the system and 
measure correlations between 
the lattice sites. Their results 
suggest the formation of nonlo- 
cal atomic pairs with a size that 


40 
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depends on the strength of 
attraction. —JS 
Science, ade4245, this issue p. 82 


CELL CYCLE 
RhoGEF hitches a ride in 
dividing cells 
In cytokinesis, the final phase of 
cell division, the plasma mem- 
brane progressively pinches 
inward along the cell equator. 
Warecki and Tao investigated 
how the activation of the enzyme 
RhoA is precisely localized to 
enable this process. In a purified 
protein system, the activator 
of RhoA, RhoGEF, accumu- 
lated at microtubule ends due 
to a motor protein complex 
called centralspindlin. In cells, 
this accumulation of RhoGEF 
sustained RhoA activity and 
cytokinesis progression at the 
cell equator. The findings sup- 
port a long-held hypothesis for 
how cells locally activate RhoA 
during cytokinesis. —LKF 
Sci. Signal. (2023) 
10.1126/scisig.eadhO601 


HIV 


bNAbs for babies 


Broadly neutralizing antibodies 
(bNAbs) have shown promise 
as a companion or alternative 
to antiretroviral therapy (ART) 
for HIV-1, although their efficacy 
in the pediatric population 
remains unclear. Shapiro et al. 
report the results of a pro- 
spective clinical trial in which 
children on ART from birth in 
Botswana were treated with two 
HIV-1 bNAbs. The treatment was 
administered initially in combi- 
nation with ART and was then 
given alone. Eleven of 25 infants 
maintained viral suppression 
during the bNAb-only step, and 
the authors found that these 
infants had more favorable 
HIV-1 reservoir characteristics, 
including a smaller initial provi- 
ral reservoir and susceptibility 
of those proviruses to bDNAb 
neutralization. These results 
highlight the potential of DNAb 
treatment for infants and chil- 
dren living with HIV-1. —CSM 
Sci. Transl. Med. (2023) 
10.1126/scitransImed.adhO004 


IN OTHER JOURNALS 


Edited by Caroline Ash 
and Jesse Smith 


CONSERVATION 
The human predator 


Predators are formative compo- 
nents of ecosystems. Although 
modern humans tend not to 
think of ourselves as predators, 
ecologically, we are the preda- 
tor with the broadest and largest 
impact across species. Darimont 
et al. surveyed more than 45,000 
vertebrate species and estimated 
that over a third are “preyed” upon 
by humans through hunting and 
the pet trade, among other activi- 
ties. Removal of these species by 
humans may have ecosystem- 
level impacts, as 40% of harvested 
species are threatened by our 
exploitative consumption. —SNV 
Commun. Biol. (2023) 
10.1038/s42003-023-04940-w 


PREDATION 
Lesser fleas ad infinitum 


We understand strategic 
aspects of the binary lifestyles 


of bacteria chiefly from model 
species. However, bacteria fall 
prey to smaller bacteria, the 
growth and division of which may 
be regulated in different ways. 
Using single-cell methods, Santin 
et al. systematically explored the 
life cycle of the bacterial preda- 
tor Bdellovibrio bacteriovorus. 
This predator forms nonbinary 
replicative filaments within com- 
partments in bacterial prey cells, 
which generate variable numbers 
of daughter cells. Larger prey cells 
mean more offspring than smaller 
cells, but it is the nutritional 
quality of the prey (experimen- 
tally manipulated through the 
growth medium) that ultimately 
determines the growth rate of the 
predator. That the timing of key 
cellular processes scales with the 
size and quality of prey cells indi- 
cates that this tiny predator can 
adjust its life cycle to the variability 
of the prey it encounters. —CA 
Curr. Biol. (2023) 
10.1016/j.cub.2023.04.059 
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SYNTHETIC BIOLOGY 


Moving toward motile 
artificial cells 


Ideally, synthetic cells need 

to be motile. For movement, 
artificial cells would need to 
be equipped with complex 
multiprotein machineries. 
Using the bacterial MinD 

and MinE proteins, Fu et al. 
achieved persistent directional 
movement of cell-mimicking 
liposomes on supported lipid 
bilayers. The Min proteins 
use energy from ATP to self- 
organize asymmetrically in the 
liposome membrane, leading to 
shape changes in the “cell” and 
mechanical force generation 
that promotes movement. This 
series of events reorganizes the 
proteins into a mechanochemi- 
cal feedback loop sufficient 

to drive continuous motion. 
—LBR 


Nat. Phys. (2023) 
10.1038/s41567-023-02058-8 
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AGRICULTURE 


Predicting yield in the field 


ost plant gene expression stud- 
ies are performed in controlled 
laboratory conditions. However, 
gene expression can be very dif- 
ferent when plants are grown in 
the constantly changing environments of 
fields. Additionally, expression patterns 
can vary even between genetically identi- 
cal individuals. De Meyer et al. sequenced 
transcripts from the leaves of field-grown 


METEORITICS 
Is the Solar System older 
than was thought? 


Calcium—aluminum-rich inclu- 
sions (CAIs) in meteorites are the 
oldest solids that formed in the 
Solar System. By convention, the 
age of CAls (4567.3 million years) 
is assumed to set the beginning 


oilseed rape (canola) plants (Brassica 
napus). Machine learning was applied to 
sequencing information to model phe- 
notypic characteristics such as branch 
numbers, seed weight, and shoot dry 
weight. For example, the authors found 
that transcriptional levels of flowering 
time genes measured in autumnal leaves 
correlated with crop yield the following 
year. Therefore, the transcriptomes of 
individual plants can be used to under- 
stand genes and processes influencing 
crop yield in field conditions. -MRS 


PLoS Comput. Biol. (2023) 
10.1371/journal.pcbi.1011161 


Rapeseed, a source of vegetable oil and protein, 
growing in fields in Germany 


of Solar System formation. Desch 
et al. reexamined this assumption 
using the aluminum—magnesium 
and lead-lead isotopic chronom- 
eters recorded in meteorites, 

and argue that the ages of CAIs 
were reset by transient heating. 
Extrapolating other meteorite 
data that were not reset indicates 
that the Solar System began to 


Revised ages for CAls found 
in meteorites such as Allende 
(pictured) show that 
they—and thus the Solar 
System—are older 
than previously 
thought. 


form 1.1 million years earlier than 
the apparent age of CAls. This is 
asmall correction compared with 
the age of the Solar System, but 
a large one when examining the 
relative timings of early formation 
events. —KTS 
Icarus (2023) 
10.1016/j.icarus.2023.115607 


ORGANOMETALLICS 
Sodium nucleophiles 
react swimmingly 


For decades, lithium has 
been the element of choice to 
stabilize negatively charged 
carbon in chemical reactions. 
Organosodium compounds, by 
contrast, have been plagued 
by solubility problems. Dilauro 
et al. report that the key all 
along was just to let the sodium 
compounds react in suspen- 
sions. Specifically, they treated 
organohalides in hexane with 
solid sodium to generate 
carbanions and then introduced 
these reagents into suspensions 
of electrophiles in water. The 
ensuing reactions proceeded 
cleanly and rapidly to produce 
coupling products, even at gram 
scale. —JSY 

Angew. Chem. Int. Ed. (2023) 

10.1002/anie.202304720 


PROTEIN THERAPEUTICS 


Drugs with a switch 
Controlling the activity of thera- 
peutics separately from their 
clearance from the body would 
add an additional layer of control 
and safety, especially for protein- + 
based drugs that activate 
immune responses. Marchand 
et al. inserted a previously 
developed OFF switch between 
the two domains of an anti- 

body and performed additional 
engineering to optimize the 
binding affinity for a competing 
ligand. Adding this competing 
molecule disrupted the protein 
complex and led to its degrada- 
tion. The authors show that this 
approach could also be used to 
generate switchable fusions with 
cytokines or other therapeutic 
proteins. —MAF 

ACS Chem. Biol. (2023) 
10.1021/acschembio.3c00012 
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Effect of climate warming on the timing of autumn 
leaf senescence reverses after the summer solstice 


Constantin M. Zohner*, Leila Mirzagholi, Susanne S. Renner, Lidong Mo, Dominic Rebindaine, 
Raymo Bucher, Daniel Palous, Yann Vitasse, Yongshuo H. Fu, Benjamin D. Stocker, Thomas W. Crowther 


INTRODUCTION: Ongoing climate change is 
causing rapid shifts in plant phenology, with 
far-reaching effects on the terrestrial carbon 
cycle and biodiversity. While advances in spring 
leaf-out dates in temperate and boreal forests 
are well documented, the effects on autumn 
leaf senescence are less clear. This is because 
leaf senescence is not only affected by tempera- 
ture but also by day length and vegetation 
activity early in the season in ways that are 
poorly understood. Accurately predicting future 
growing-season lengths and plant photo- 


Pre-solstice effect 


Slow development causes late 
senescence onset 


Leaf greenness 


Fast development causes early 
senescence onset 


Spring 


Solstice 


synthesis requires a better understanding of 
these interacting mechanisms at broad spa- 
tial scales. 


RATIONALE: Local observations and experiments 
suggest that early-season warming, causing 
earlier spring leaf-out and faster plant devel- 
opment, tends to advance autumn senescence 
dates. Conversely, late-season warming tends 
to delay autumn senescence. If true more gen- 
erally, then climate warming has opposing 
effects at the start and end of the growing 


Post-solstice effect 


Fast progression and early end 
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Slow progression and late end 
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Senescence 


Autumn phenological responses to pre-solstice and post-solstice climate warming. |n cold years, slow 
development before the summer solstice delays the onset of senescence, and cold autumn temperatures 
accelerate senescence progression. In warm years, fast development before the summer solstice 
advances senescence onset, and warm autumn temperatures slow senescence progression, delaying 


the end of senescence. 
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season, with a reversal of effects somew] Sine 
in between. To test the generality of the oppc.-—s 
effects of climate warming on leaf senescence 
in Northern Hemisphere forests, we used satel- 
lite, ground, and carbon flux data, as well as 


controlled experiments. 


RESULTS: Our results revealed that warming 
early and late in the growing season indeed 
has contrasting effects on leaf senescence, 
with a reversal occurring after the summer 
solstice. Across 84% of the northern forest 
area, we found that warmer temperatures 
and increased vegetation activity before the 
solstice advance the onset of senescence by 
1.9 + 0.1 days per °C, whereas warmer post- 
solstice temperatures slow the progression 
of senescence by 2.6 + 0.1 days per °C. Be- 
tween 1966 and 2015, the earlier onset of 
senescence has led to advances of 0.20 + 0.07 
days per year of the date at which autumn 
temperature starts to drive senescence pro- 
gression. By contrast, mid-senescence contin- 
ues to occur slightly later by 0.04 + 0.01 days 
per year, leading to a lengthening of the 
autumnal senescence period. 

In our experiments, warmer pre-solstice tem- 
peratures also led to earlier primary growth ‘ 
cessation (bud set), demonstrating that the 
impact of a warmer pre-solstice period ex- 
tends beyond leaf development and life span. 
This highlights the crucial role of overall * 
plant development and sink activity before 
the summer solstice in determining growing- 
season length. 


CONCLUSION: We have developed a unified 
explanatory framework for predicting autumn 
phenology, showing that leaf senescence now 
tends to begin earlier, because of increased 
pre-solstice vegetation activity, but progresses 
more slowly, such that the end of senescence < 
occurs later. The reversal in trees’ responsiveness ‘ 
to warming after the summer solstice likely is 
triggered by changes in day length and allows 
them to initiate the physiological processes of . 
leaf senescence and nutrient resorption in a 
fine-tuned balance between source and sink 
dynamics. Our results demonstrate the impact 
of developmental constraints (from cell and 
tissue growth) on autumn leaf senescence and 
forest productivity, affecting trends in growing- 
season length across the entire Northern 
Hemisphere. These insights provide a better 
understanding of the changes in growing sea- 
sons and carbon uptake of temperate and 
boreal forests under climate change. ® 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: constantin.zohner@gmail.com 
Cite this article as C. M. Zohner et al., Science 381, eadf5098 
(2023). DOI: 10.1126/science.adf5098 
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Effect of climate warming on the timing of autumn 
leaf senescence reverses after the summer solstice 


Constantin M. Zohner™*, Leila Mirzagholi+?, Susanne S. Renner®, Lidong Mo’, 
Dominic Rebindaine’, Raymo Bucher’, Daniel Palous"*, Yann Vitasse°, 
Yongshuo H. Fu®, Benjamin D. Stocker”®, Thomas W. Crowther! 


Climate change is shifting the growing seasons of plants, affecting species performance and biogeochemical 
cycles. Yet how the timing of autumn leaf senescence in Northern Hemisphere forests will change remains 
uncertain. Using satellite, ground, carbon flux, and experimental data, we show that early-season and 
late-season warming have opposite effects on leaf senescence, with a reversal occurring after the year’s 
longest day (the summer solstice). Across 84% of the northern forest area, increased temperature and 
vegetation activity before the solstice led to an earlier senescence onset of, on average, 1.9 + 0.1 days per °C, 
whereas warmer post-solstice temperatures extended senescence duration by 2.6 + 0.1 days per °C. The 
current trajectories toward an earlier onset and slowed progression of senescence affect Northern 
Hemisphere—wide trends in growing-season length and forest productivity. 


he phenological cycles of trees exert a 

strong control on the structure and func- 

tioning of ecosystems (J, 2) and global 

carbon, water, and nutrient cycles (3-5). 

Anthropogenic climate change has re- 
sulted in shifts in the growing seasons of tem- 
perate and boreal trees, with the start of the 
season today occurring, on average, two weeks 
earlier than it did during the 19th and 20th 
centuries (6) and the end of the season (EOS) 
being delayed (4, 7, 8). Each day of a longer 
growing season may increase net ecosystem 
carbon uptake by 3.0 to 9.8 gC m ° (4). Yet, 
owing to the complex and interacting effects 
of growing-season climate and the annual day- 
length cycle, the direction of EOS changes in 
response to climate change and the cascading 
effects on ecosystem productivity remain high- 
ly uncertain (9-13). 

Characterizing the interplay among the en- 
vironmental drivers of EOS at broad spatial 
scales is integral to improving our understand- 
ing of growing-season length and tree growth. 
Cell division, tissue formation, and growth in 
northern trees are highest at the beginning of 
the season and decline with shortening days 
(14-18), the adaptive reason being the limited 


time remaining for tissue maturation and bud 
set before the first frost (19). Local observa- 
tions and experiments have shown that early- 
season warming, causing earlier spring leaf-out 
and faster growth and tissue maturation, 
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1: Slow development / low temperature 
2: Fast development / high temperature 


tends to advance EOS dates (9, 11, 20, 21), 
whereas late-season warming has the opposite 
effect, delaying the EOS (22-24). Increased 
temperatures and physiological activity in the 
beginning of the season might drive earlier 
autumn senescence through a variety of pos- 
sible mechanisms, including developmental 
and nutrient constraints (9, 25, 26), seasonal 
buildup of water stress (27, 28), and radiation- 
induced leaf aging (29). In contrast, later in 
the season, a direct effect of temperature (cool- 
ing) is likely to drive the timing of autumn 
senescence (10, 24, 30). If these trends are 
correct, then climate warming has opposing 
effects at the start and end of the growing 
season, with a reversal of effects somewhere 
in between. 

In this study, we tested whether early- 
season temperature and vegetation activity 
drive an earlier EOS across temperate and 
boreal forests, with day length providing the 
linkage between seasonal activity, air warm- 
ing, and autumn phenology (Fig. 1). We ex- 
pected that early-season plant development 
and growth determine the onset of senescence 
(Fig. 1, scenario 1 versus 2), while autumn tem- 
perature affects senescence progression (to- 


ward full dormancy), with faster chlorophyll ‘ 


breakdown in colder autumns than in warmer 


A: Cold autumn 
B: Warm autumn 


Scenario 2 Scenario 1 Winter 


EOSio 


Fig. 1. Conceptual model of autumn phenological responses to pre-solstice and post-solstice growth 
and temperature (solstice-as-phenology-switch hypothesis). The onset of autumnal senescence was 
estimated in this study as the date when the greenness index last dropped by >10% of the seasonal 
maximum (EOSj9). In northern forests, stem growth, development rates, and photosynthetic capacity are 
highest before the summer solstice and decline with shortening days (14-16). Interannual variation in EOS; 
is a function of pre-solstice growth and development, with later EOSio in years with slow development 
and low temperature before the solstice (scenario 1) and earlier EOSy9 in years with fast development and 
high temperature (scenario 2). The progression of leaf senescence varies with autumn temperature, with 
faster chlorophyll breakdown in cold-autumn years (scenario A) than in warm-autumn years (scenario B). 
The dates of 50% chlorophyll loss (EOSs9) are therefore driven by pre- and post-solstice effects, whereas 
EOSio dates are mainly driven by pre-solstice effects. An earlier start of senescence in high-activity years 
(scenario 2) also predicts that trees will become sensitive to autumn cooling earlier than in low-activity 
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autumns (30, 31) (Fig. 1, scenario A versus B). 
Continued acceleration of early-season growth 
and development under climate warming 
(4, 5, 32, 33) might thus cause an ever earlier 
EOS onset, whereas the progression of senes- 
cence should be slowed down by warmer 
autumns (22, 23), lengthening the overall 
senescence period. This leads to four funda- 
mental predictions that were tested in this 
study: (i) Enhanced pre-solstice temperature 
and vegetation activity drives an earlier senes- 
cence onset (Fig. 1, scenario 1 versus 2). 
(ii) Growth and temperature effects on senes- 
cence dates reverse around the time of the 
summer solstice. (iii) Autumn temperature af- 
fects the speed of senescence, delaying its later 
stages (scenario A versus B), but has little ef- 
fect on its start. (iv) The date when trees be- 
come sensitive to autumn cooling (blue arrows 
in Fig. 1) has advanced over recent decades 
because of an earlier onset of senescence. 

To test these hypotheses, we combined 
phenology data from satellite observations 
across Northern Hemisphere temperate and 
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Fig. 2. Effect of temperature on the timing of autumn senescence in northern 
forests reverses after the summer solstice. Autumn phenology is represented 

as EOSi9 (A to C) or EOSs9 dates (D to F) from satellite observations (see Fig. 3A). 
(A and D) Relationship between mean monthly Tyay and EOS dates (means + 
95% confidence ranges) from multiple linear regression models. Percentages reflect 
the total positive and negative areas under the curve, meaning the relative advancing 
versus delaying effects of seasonal Tyay. (B and E) Effects of pre-solstice daytime 
temperature (Tgay pre; 20 March to solstice) and precipitation (Prcp pre), post- 
solstice daytime temperature (Tday post; solstice to mean pixel-level EOS date) and 
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boreal forests (34), ground observations from 
European (35) and American (36) deciduous 
trees, eddy covariance flux tower measure- 
ments (37), and controlled experiments on 
European beech (38). Seasonal vegetation 
activity was estimated using direct mea- 
surements of leaf-level gas exchange and 
ecosystem carbon fluxes (37), as well as 
photosynthesis models [satellite-derived 
gross primary productivity (GPP) (39) and 
the Lund-Potsdam-Jena General Ecosystem 
Simulator (LPJ-GUESS) model (9)]. We then 
ran linear models to test for the monthly and 
seasonal effects of photosynthesis, temper- 
ature, shortwave radiation, and water availa- 
bility on EOS dates. The satellite data allowed 
us to differentiate between the onset of senes- 
cence and its progression by analyzing the 
dates when greenness had dropped by 10% 
(EOS,o) or 50% (EOS;s0) relative to the seasonal 
maximum. The experiments allowed us to di- 
rectly test for seasonal variation in the effects 
of air temperature and radiation. Finally, we 
mapped the relative effects of early-season veg- 
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etation activity and late-season climate across 
Northern Hemisphere forests to test for pos- 
sible biogeographic patterns in the drivers of 
autumn senescence. 


Effect of temperature and vegetation activity 
on senescence dates reverses after the 
summer solstice 


Satellite-based phenology data (Figs. 2 and 
3 and figs. S1 to S3), European (Fig. 4) and 
American (fig. S4) ground observations, flux 
tower measurements (figs. S5 and S6), and 
experiments (fig. $7) all revealed a consistent 
advancing effect of pre-solstice (i.e., before 
21 June) temperature and productivity on EOS 
dates, which declined after the summer sol- 
stice. Thus, across 84% of the northern forest 
area [18% (Fig. 3C) and 22% (fig. SIC) sig- 
nificant at P < 0.05], increased pre-solstice 
daytime temperature (Tgay) and photosyn- 
thesis led to an earlier onset of senescence: 
each 1°C increase in pre-solstice temperature 
resulted, on average, in 1.9 + 0.1 (mean + 2 SE) 
days earlier EOS, (Fig. 3D). Similarly, each 
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precipitation (Prcp post), and atmospheric CO2 on EOS dates [(E) additionally 
includes the effect of autumn Taay]. (© and F) The univariate effects of 1-month-long 
Tyay intervals around the summer solstice [13 May to 11 June, 23 May to 21 June, 
2 June to 1 July, 12 June to 11 July, 22 June to 21 July, and 2 July to 31 July; the 
inset in (C) illustrates the time of year of each period, and the gray vertical line 
represents the summer solstice] on EOS dates. Analyses show relative effect sizes 
from pixel-level linear models. To examine relative effect sizes, both predictor and 
dependent variables were standardized by subtracting their mean and dividing by 
one standard deviation before analysis. 
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Fig. 3. Satellite observations reveal consistent advances in the onset of 
senescence (EOSjo) across northern forests in response to increased 
pre-solstice temperature. (A) Map showing the effects of pre-solstice 

(20 March to solstice) Tyay on EOS; timing at 0.25° resolution from linear 
models, including pre-solstice Tgay and post-solstice (solstice to mean pixel- 
level EOS date) Tyay as predictor variables. Red pixels indicate an earlier 
EOSio under increased pre-solstice Tgay, blue pixels indicate a delayed EOSjo. 
(B) Regression coefficient means and 95% confidence ranges summarized 
for each degree latitude (pre-solstice effects in red, post-solstice effects in 
black). (C) The distribution of the pre-solstice and post-solstice Tyay effects 


10% increase in pre-solstice photosynthesis 
resulted in 3.6 + 0.1 days earlier EOS)o (satel- 
lite data; fig. S1) or 3.7 + 1.2 days earlier PD»; 
(date when photosynthesis had dropped by 25% 
relative to the seasonal maximum according to 
flux tower data; fig. S6). A significant delay- 
ing effect of pre-solstice temperature and 
photosynthesis was found for <1% of northern 
forest pixels. 

Post-solstice temperature had a small effect 
on the onset of senescence (see Figs. 2, A and 
B, and 5A for EOS) dates, and fig. S8, A and B, 
for EOS.tart dates as an alternative measure of 
senescence onset) but prolonged its duration 
(number of days from EOS 9 to EOS¢;5, EOS; 
to EOS;o, or EOSs0 to EOSg,;) by 1.3 to 3.5 days 
per °C (fig. S9), explaining the large effect of 
autumn temperatures on EOSso dates (Fig. 2, 
D and E, and fig. S2, C to E, for satellite ob- 
servations and Figs. 4 and 5C for European 
plots). Precipitation and CO, levels had com- 
paratively small effects (Fig. 2, B and E, and 
fig. S10). 

The reversal of the effects of air temperature 
and vegetation activity on EOS dates after the 
summer solstice was consistent across (i) all 
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EOS metrics used here, that is, the onset of 
senescence (EOS 9 or EOSgtart3 Fig. 2, A to C, 
and fig. S8) and mid-senescence (EOS;9; Fig. 2, 
D to F); (ii) continents and forest types (fig. 
S11); and ii) a set of alternative variables 
linked to growing-season activity and devel- 
opment, namely Tyay (Fig. 3 and fig. S2) and 
photosynthetic activity (figs. S1 and $3). Along 
the full latitudinal gradient (30°N to 65°N) 
studied here, the period during which vegeta- 
tion productivity had an advancing effect on 
EOS,, dates consistently ended after the sol- 
stice, at ~26 June (fig. S12). The effect reversal 
after the summer solstice was further sup- 
ported by an analysis that used 10-day moving 
steps around the solstice (Fig. 2, C and F). To 
further test for the importance of separating 
pre- and post-solstice variables for EOS pre- 
dictions, we ran leave-one-out cross-validation 
of models that included (i) only pre-solstice, 
Gi) only post-solstice, or (iii) both types of var- 
iables as predictors of EOS dates (38). In full 
agreement with our hypothesis (Fig. 1), the 
onset of senescence (EOS,9) was better ex- 
plained by pre-solstice than by post-solstice 
variables, and the full model—including both 
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2015 


across all pixels. Mean pre- and post-solstice Tgay regression coefficients and 
the percentage of pixels with an advancing pre-solstice Tgay effect or delaying ‘ 
post-solstice Tuay effect shown as red and black text, respectively 
(percentage of significant pixels at P < 0.05 in brackets). (D) Partial effect of 
pre-solstice Tgay on EOSio dates, controlling for year (fixed effect) and site 
(random effect). (E) Temporal trends in EOSj) anomalies (means + 95% 
confidence ranges) from mixed-effects models with year as a single fixed 
effect (red line and equation) or including pre-solstice Tgay as an additional 
fixed effect (black line and equation). Pixels were treated as grouping 
variables of random intercepts. 


pre-and post-solstice variables—showed only 
slightly better performance than the pre- 
solstice model (fig. S13, A and D). Because 
autumn temperature modulates the progres- 
sion of senescence, EOS, dates were slightly 
better explained by post-solstice than by pre- 
solstice variables, with the combination of both 
variables yielding the best predictions (fig. $13, 
B, C, E, and F). 

In line with the satellite observations, high 
pre-solstice temperature and productivity cor- 
related with advanced EOS;, dates in the 
European and American plot data, across all 
species (Fig. 4B and fig. S4) and across a set of 
alternative variables [Tyay (Fig. 4) and LPJ- 
GUESS model-derived photosynthesis (fig. 
$15)]. On the basis of these findings, we ran 
multivariate mixed models for the European 
plot data, including pre-solstice and post- 
solstice (solstice to mean EOS;,) temperature 
or photosynthesis and precipitation, CO, levels, 
and autumn nighttime temperature (autumn 
Tnignt) to determine their relative importance. 
Pre-solstice temperature (or photosynthesis) 
had the strongest advancing effect on EOSs9 
dates, whereas autumn T,ient had the strongest 
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Fig. 4. The seasonal effects of temperature on interannual variation in 
mid-senescence (EOSs9 dates) from European long-term observations 
(PEP725 data). (A) Effects of monthly (January to October) Tuay on autumn 
senescence dates. Percentages reflect the total positive and negative areas 
under the curve, meaning the relative advancing versus delaying effects of 
temperature. (B) Species-level results. (€) The relative effects of pre-solstice 
(20 March to solstice) and post-solstice (solstice to mean EOSs9) temperature, 
pre-solstice and post-solstice precipitation, atmospheric COz, and autumn 
nighttime temperature (Autumn Thight). (D) Effects of 1- 
intervals around the summer solstice (13 May to 11 June, 23 May to 21 June, 


2 June to 1 July, 12 June to 11 July, 22 June to 21 July, 


see inset), including the respective temperature interval as a single fixed effect. 


delaying effect on EOS; dates, with both ef- 
fects being more than three times greater than 
that of precipitation and atmospheric CO, 
(Fig. 4C). EOS predictions showed that the 
full model representing both pre- and post- 
solstice effects captured within-site EOS;, trends 
in response to mean annual temperature (ad- 
vance of 0.4: + 0.1 days per °C increase in mean 
annual temperature; fig. S16). In contrast, a 
model representing only post-solstice tem- 
perature and precipitation predicted delays 
of 0.7 + 0.1 days per °C, whereas a pre-solstice 
model predicted advances of 0.9 + 0.1 days 
per °C. These findings demonstrate that in- 
formation on both pre- and post-solstice cli- 
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mate is necessary to reproduce the observed 
EOSs9 responses to rising temperature. 


Ongoing trends toward an earlier start, slowed 
progression, and later end of senescence 


Our finding that the onset of senescence is 
driven by pre-solstice vegetation activity and 
development, while the speed of its progres- 
sion depends on autumn temperature (fig. $9), 
suggests that global warming leads to an 
earlier start and slowed progression of senes- 
cence (scenario 2B in Fig. 1). Indeed, across 
all analyzed northern forest pixels, the onset of 
senescence (EOS,9 date) has advanced by an 
average of -0.4 + 0.1 days per decade between 
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temperature from 20 March to 22 May (Mar—May), 
ar-Sol), 20 March to 21 July (Mar-Jul), 20 March to 


20 August (Mar—-Aug), 20 March to mean EOSs9 (Mar-Off), 22 May to mean 
EOSs0 (May-Off), solstice to mean EOSs9 (Sol-Off), and 21 July to mean 

ng the respective temperature interval as fixed effect. 
(F) Moving-window analysis, showing the “reversal” dates when the temperature 
effect switches from negative to positive for each 20-year time period from 


monthly correlations; see (A) to (C)]. On average, the 


models, including time series and species [(A) and (C) 


ies [(B) and (F)] as random effects, with both predictor 
and dependent variables standardized. R*, coefficient of determination. 


2001 and 2018 (Fig. 3E), in parallel with 
increased pre-solstice vegetation productivity 
(fig. S14, A and B), with the strongest 
advances in EOS, dates found for regions with 
the largest increase in pre-solstice productivity 
(fig. S14K). When removing the effect of pre- 
solstice temperature or photosynthesis on 
senescence onset by including either variable 
as a fixed effect in addition to year, the model 
instead predicted delays in EOS,, dates of 0.8 
to 14 days per decade (Fig. 3E and fig. S1E), 
underscoring how strongly temperature-driven 
pre-solstice vegetation activity affects the start 
of senescence. The warmer autumns (fig. S14, I 
and J) are instead slowing the progression of 


4 of 8 


4 


by 0.7 days per year. Analyses show effect size means + 


RESEARCH | RESEARCH ARTICLE 


A EOS 1p (Satellite) 

xs) 

2 

o 

xo) 

® 

N 

£ 

w 

To 

Cc 

a 

7) 

B EOSzp (Satellite) 

xs) 

2 

D 0.5 

_e) 

g 

5 0.0 

w 

ze 

§ -05 

7) 

Cc EOSs (PEP725 data) 

xs) 

£ 

5 0.5 

xe) 

N 

5 0.0 

w 

ze 

§ -05 

7) 
CF CFS © S&S 

Pe € o Ss # & i 

Aes S VL gs 


Fig. 5. The effects of temperature, radiation, 
spring leaf-out dates, and precipitation on inter- 
annual variation in the timing of EOSio and 
EOSso. Panels show results for satellite-derived 
EOSio dates (A), satellite-derived EOSs9 dates 

(B), and ground-sourced EOSso dates (C). All linear 
models include mean Tyay and shortwave radiation 
(SWrad) and the sums of precipitation (prep) 

from 20 March to 21 June (pre-solstice) and from 
22 June to the mean EOS date of each time series 
(post-solstice), as well as spring leaf-out dates, 

as predictor variables. Models were run at the pixel 
level [(A) and (B)] or individual level (C), and 
boxplots show the spread of effect sizes. All 
variables were standardized to allow for direct effect 
size comparison. 


senescence, leading to delays in EOS; dates 
(fig. S2G), especially in regions with the largest 
increases in autumn temperature (Fig. S14M). 

The offsetting effect of pre-solstice vegeta- 
tion activity on autumn warming-induced 
delays in EOS; is also apparent in regional 
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trends over the past 70 years (time series 
and species as random effects; fig. $17). On 
average, European EOS;, dates have been 
delayed by only +0.35 + 0.02 days per decade 
(fig. S17B). By contrast, when keeping pre- 
solstice productivity constant by including it 
as a fixed effect in addition to year, the model 
predicts a delay of +0.81 + 0.03 days per decade 
(fig. S17D), showing that the increase in pre- 
solstice vegetation productivity has offset up 
to ~60% of the delay in EOS;, that would 
have occurred had pre-solstice productivity 
not increased. This explains why EOS;, delays 
have contributed only ~15% (2.4 + 0.2 days) to 
the 16.7 + 0.4 day-long extension of the grow- 
ing season that has occurred over the past 
70 years (fig. S17, A and B). 

The longer senescence duration (fig. S9) and 
delayed EOS;9 dates (when greenness has 
dropped by 50%) under warmer autumns re- 
veal how autumn temperature modulates 
senescence (Figs. 4C and 5, B and C, and fig. 
S18, C and D). However, if increased pre- 
solstice vegetation development (fig. S14A) 
is the main driver of an earlier onset of EOS, 
one should find an ever earlier susceptibility 
of trees to autumn cooling. We tested this by 
using temporal moving-window analyses on 
the European long-term observations and 
found that the effect reversal dates, at which 
increased temperature and productivity start 
to be associated with delayed EOS;, dates, 
have indeed advanced, by an average of -0.7 
to -1.0 days per year (Fig. 4F and fig. SI5F or 
fig. SI9B for a shorter moving window). This 
is also reflected in the moving windows of 
monthly effect sizes, which show that July 
photosynthesis has become more strongly as- 
sociated with delayed EOS, dates over recent 
decades (fig. S19C). As an alternative method 
of determining when autumn cooling starts 
driving senescence progression, we modeled 
the autumn period best explaining EOS» dates 
and found that it has advanced by -0.20 + 
0.07 days per year for the 1966-2015 period 
(fig. S20A) or by -0.43 + 0.09 days per year for 
the 1981-2015 period (fig. S20B). This earlier 
start of the period when trees react to autumn 
cooling provides further evidence for an earlier 
onset of senescence in response to increased 
early-season development. 

An important consideration of our autumn 
senescence model presented in Fig. 1 is that 
the effect reversal date—the compensatory point 
of the advancing and delaying effects—should 
be flexible despite day length likely trigger- 
ing the initial decline in the advancing effect 
after the summer solstice (14, 15, 17). This is be- 
cause the actual reversal date of the antagonistic 
effects, meaning the period when trees become 
sensitive to cooling in late summer, is a func- 
tion of warming and development before the 
summer solstice, which will vary (Fig. 1). In- 
volvement of day length and a flexible effect 


reversal date are therefore not mutually exclu- 
sive ideas but rather necessary components of 
the same model. 


Factors governing the link between 
pre-solstice temperature and senescence onset 


Previous research on European deciduous trees 
has suggested a negative feedback between 
growing-season productivity and autumn phe- 
nology, with increased productivity driving 
earlier senescence (9). The data analyzed here 
now reveal that the productivity before the 
summer solstice is indeed linked to earlier 
senescence dates and that this holds across 
the entire Northern Hemisphere temperate 
and boreal forest biome, implying a wide- 
spread constraint on future growing-season 
extensions in response to global warming. To 
disentangle the environmental drivers of this 
feedback, we ran multiple linear regression 
models that included air temperature, solar _ 
radiation, water availability, and spring leaf- 
out dates as predictor variables, all of which 
have been shown to affect leaf senescence 
dates (9, 11, 20, 26, 28, 40) (Fig. 5 and figs. S21 
or S22 using soil moisture instead of precip- 
itation to represent water availability). Across 
the majority of pixels or sites, pre-solstice tem- 
perature was the main driver of EOS, dates 
(Fig. 5A), whereas EOS;, dates were driven by 
both pre- and post-solstice temperatures, with 
opposite effect directions (Fig. 5, B and C). This 
suggests that temperature-driven development 
and growth—rather than radiation-induced 
leaf aging, drought, or leaf-out per se (i.e., 
constrained leaf life span)—are driving the 
advancing effect of early-season vegetation ac- 
tivity on autumn phenology. 

To further isolate the mechanisms driving 
the discovered reversal of the effects of global 
warming around the summer solstice, we con- 
ducted an experiment on a dominant European 
tree species (Fagus sylvatica), in which we 
cooled (chamber temperature set to 10°C dur- 
ing the day and 5°C at night) and shaded 
(~84% light reduction) saplings during dif- 
ferent times of the season. Pre-solstice tem- 
perature again had a strong advancing effect 
on autumn phenology, with cooling of trees 
in June causing a delay in EOS;) and EOSs9 
dates of +16.5 + 6.6 days and +10.2 + 2.5 days 
(mean + SE), respectively. Cooling in July had 
no effect, and August cooling tended to ad- 
vance EOS dates (figs. S7A and S23A), in full 
agreement with the global-scale remote sens- 
ing data and the ground observations. The ef- 
fect of shading was small before the summer 
solstice and most pronounced during July—the 
month with the highest mean daily radiation 
and temperature—with EOS; delayed by +6.5 
+ 2.8 days under shade conditions. Radiation 
effects thus followed a different seasonal pat- 
tern than temperature, supporting a direct ef- 
fect of radiation on leaf aging (26, 29). Summer 
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photosynthesis was equally reduced in both the 
shade and the cooling treatments by 52 to 72% 
compared with the control (fig. S24). That pre- 
solstice temperature, but not pre-solstice light 
availability, affected EOS dates provides further 
support for the idea that accelerated develop- 
ment and growth under warmer temperatures, 
rather than photosynthesis- or radiation- 
induced leaf aging, are the main drivers of 
the pre-solstice effects on senescence dates. 
Yet given that both plant developmental speed 
(tissue expansion and meristematic activ- 
ity) and photosynthesis early in the season 
are strongly driven by temperature (47), both 
processes are linked, and pre-solstice GPP 
therefore appears to be a reliable proxy of 
this effect (fig. S1). 

An important future avenue will be to dis- 
entangle the relative roles of developmental 
processes within leaves (42) versus growth 
processes in the rest of the plant (25, 47) in 
driving earlier senescence under increased 
pre-solstice temperatures. That pre-solstice 
temperatures and productivity advanced senes- 
cence even in species with continuous leaf pro- 
duction throughout the season, such as Betula 
pendula (Fig. 4B and fig. SI5B), suggests that 
leaf development and longevity cannot be the 
sole drivers, implying involvement of growth 
(sink) processes in other organs, including 
wood, roots, and buds. This idea was further 
supported by a second experiment in which 
we tested the response of primary growth ces- 
sation (bud set) to pre- and post-solstice 
warming. The results showed that pre-solstice 
warming (22 May to 21 June) advanced bud 
set in F. sylvatica by 4.4 + 1.8 days, whereas 
post-solstice warming (22 June to 21 July) de- 
layed bud set by 4.9 + 1.8 days (fig. S25), con- 
sistent with an effect reversal after the summer 
solstice. The negative effect of warmer pre- 
solstice temperatures on growing-season dura- 
tion is therefore not limited to leaf development 
and life span (23, 42, 43), suggesting an im- 
portant role of overall plant growth and sink 
activity before the solstice in determining 
autumn growth cessation and leaf senescence 
(14, 16, 17, 41). 

The observed EOS advances under elevated 
pre-solstice temperature and growth may ad- 
ditionally be linked to availability of soil re- 
sources throughout the growing season. Optimal 
growth conditions in spring (high tempera- 
tures and sufficient precipitation) may deplete 
nitrogen availability to trees, which in turn may 
drive early bud set and senescence (25, 26, 44). 
This effect may be further enhanced by in- 
creased C:N ratios of leaves under rising atmo- 
spheric CO., causing microbial immobilization 
and reduced nitrogen accessibility (45, 46). 


Conclusions 


Our investigation of the seasonal drivers of 
autumn phenology shows a consistent reversal 
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after the summer solstice in the effects of cli- 
mate warming on leaf senescence (i) across 
large biogeographic ranges with varying pre- 
solstice growth (satellite data); (ii) in multiple, 
broad-ranged tree species with different spring 
phenologies (ground data); (iii) in eddy covar- 
iance measurements; and (iv) under controlled 
experimental conditions. These findings reveal 
important constraints on growing-season length 
and photosynthetic activity, whereby earlier 
and enhanced activity before the summer sol- 
stice drives earlier photosynthetic declines, 
primary growth cessation, and senescence in 
autumn. The seasonal control on EOS by air 
temperature and vegetation activity may be 
mediated by the annual day-length cycle (15, 17) 
and nutrient availability (26, 44). Year-to-year 
differences in the onset and progression of 
autumn senescence thus emerge as the result 
of a complex synchronization between trees’ 
developmental states, seasonal variation in the 
circadian rhythm, and climate fluctuations. 
This mediation by the annual day-length cycle 
provides a unifying framework to explain pre- 
vious results, in which the magnitude and di- 
rection of climate warming effects on autumn 
phenology varied (9, 12, 22, 23, 33, 47-50), 
largely because studies did not disentangle 
pre- and post-solstice variables. It now is clear 
that warmer temperatures and increased veg- 
etation activity before the summer solstice 
drive an earlier onset of senescence, whereas, 
in agreement with previous studies (12, 47, 57), 
warmer temperatures after the solstice slow 
down the progression of senescence, predict- 
ing that senescence will continue to start 
earlier but progress more slowly. 

The reversal in how trees respond to tem- 
perature during the summer likely evolved as 
an adaptation to seasonal climates with harsh 
winters because it allows plants to reliably 
anticipate the progression of the seasons and 
prepare for winter dormancy long before the 
temperature starts dropping (19). The solstice 
switch in trees’ physiological responsiveness to 
temperature calibrates their seasonal rhythms 
and mediates how they react to warm or cool 
temperatures, now and in the future. This en- 
ables trees to initiate tissue maturation and 
the physiological processes of leaf senescence 
and nutrient resorption (75) in a fine-tuned 
balance between source and sink dynamics. 
Improved models of plant development and 
growth under climate change will need to in- 
corporate the reversal of warming effects after 
the summer solstice. 


Methods summary 
Satellite observations 


Start-of-season (SOS) and EOS dates for North- 
ern Hemisphere forests from 2001 to 2018 
came from the MODIS Global Vegetation 
Phenology product (34). SOS was defined as 
the date when satellite-derived greenness had 


reached 15% of its annual maximum. EOS was 
defined as the date when greenness had de- 
creased by 10% (EOSj9), 50% (EOSs9), or 85% 
(EOSg;), representing the start of senescence, 
mid-greendown, and dormancy, respectively. 
As an alternative measure of the start of senes- 
cence, we used the VIIRS/NPP Land Cover Dy- 
namics product (52), which uses a breakpoint 
algorithm to define the onset of greendown. 
Information on climate came from (53), and 
GPP came from the MODIS GPP product (39). 
We included GPP, Taay, Tnignt, Shortwave 
radiation, CO, levels, precipitation, soil mois- 
ture, and SOS dates as covariates in our 
analyses. 

To investigate the seasonal effects of tem- 
perature, photosynthesis, shortwave radiation, 
and water availability on the timing of EOS;, 
dates, we aggregated data for different periods 
before and after the summer solstice. For each 
time series, we also determined the optimal 
autumn interval for which temperature ex- 
plains most of the variation in EOS;, dates. 
We quantified the importance of each covar- 
iate in driving interannual variation in EOS 
dates with pixel-level linear models (Figs. 2 
and 3). To approximate the end date of the 
period during which early-season vegetation 
activity had an advancing effect on the onset 
of senescence, we tested the correlation be- 
tween different GPP periods (always begin- 
ning at the SOS date) and EOS,, dates (fig. 
$12). To test for decadal-scale temporal trends 
in EOS dates, we ran mixed-effects models in- 
cluding only year or year and pre-solstice tem- 
perature (Fig. 3, D and E) or GPP (fig. S1, D and 
E) as fixed effects and treating pixels as group- 
ing variables of random intercepts. 


Ground observations 


Ground data on 396,411 EOS dates of four 
dominant tree species in central Europe be- 
tween 1951 and 2015 came from the Pan Eu- 
ropean Phenology Project (35). Climate data 
for the same period were obtained from the 
Global Land Data Assimilation System (53). 
The EOS; corresponded to the date when 50% 
of leaves had lost their green color. We in- 
cluded photosynthesis, Taay, Thignt, Shortwave 
radiation, CO, levels, precipitation, soil mois- 
ture, and spring leaf-out dates as covariates in 
our analyses. Daily photosynthesis was derived 
from the LPJ-GUESS photosynthesis model 
(64). As for the satellite observations, data were 
aggregated for monthly and longer periods 
before and after the summer solstice, and we 
also determined the optimal autumn interval 
for which temperature explains most of the 
variation in EOSs, dates. 

The importance of these variables in driving 
EOS dates was then evaluated with linear 
mixed models, including time series (unique 
site and species combination) and species ran- 
dom effects. Our final model included fixed 
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effects of pre- and post-solstice Tgay (Fig. 4C) or 
photosynthesis (fig. S15C), as these variables 
emerged as the strongest drivers of EOS dates. 

To test whether the relative effects of var- 
iables have been changing over the past de- 
cades, we ran the mixed models separately for 
each 20-year (or 15-year) time period from 
1966 to 2015 (or 1980 to 2015). To estimate the 
date at which the effect of temperature and 
photosynthesis reverses, we conducted moving- 
window analyses of the monthly temperature 
and photosynthesis effects. Additionally, we 
estimated the day at which autumn temper- 
ature starts driving senescence progression 
by calculating the autumn period for which 
temperature best explained variation in EOS;, 
dates for each moving window (fig. S20). We 
additionally analyzed direct observations of 
EOSso dates for 10 North American tree spe- 
cies from 1991 to 2019 from the Phenology of 
Woody Species at Harvard Forest dataset (36) 
(fig. S4). 


Flux tower measurements 


Eddy covariance data from 10 temperate forest 
(broadleaf deciduous/mixed) sites with >9 years 
of data were gathered from the FLUXNET2015: 
CC-BY-4.0 dataset (37). To represent the onset 
dates of photosynthesis declines (PD) in late 
summer, we extracted the day of year on 
which GPP last dropped below 10% (PDjo; 
fig. S5) or 25% (PD,;; fig. S6) of maximum an- 
nual GPP. We then tested for the relationships 
between monthly (April to September) GPP 
estimates and PD, or PD»; dates by running 
mixed-effects models, including site as a ran- 
dom effect. 


Experiments 


In the first experiment, we tested for the ef- 
fects of seasonal variation in temperature and 
light on EOS dates. We exposed 3-year-old 
F. sylvatica trees to cooling or shading con- 
ditions during four 1-month-long periods (from 
May to August). In the cooling treatments, trees 
experienced a nighttime temperature of 5°C 
and a daytime temperature of 10°C. In the 
shading treatments, trees were exposed to 
an ~84% reduction in incoming radiation. In- 
dividual leaf senescence dates were defined as 
the day of year when chlorophyll content last 
dropped by 10% (EOS,,) or 50% (EOS; 9) of the 
maximum chlorophyll content in summer. 
Leaf net photosynthesis was measured with a 
portable infrared gas analyzer (LI-6800, LI-COR 
Biosciences, Lincoln, NE). To test for differences 
in leaf senescence dates among treatments, we 
ran multivariate linear models including tem- 
perature and shade treatments as categorical 
variables. 

In the second experiment, we tested the ef- 
fects of pre- and post-solstice warming on bud 
set dates by exposing 4-year-old F. sylvatica 
trees to cooling conditions (8°C during day 
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and night) during 1-month-long periods before 
and after the solstice. Bud set was defined as 
the date when the buds of an individual had 
reached >90% of their final length (23), indi- 
cating aboveground primary growth cessation 
(23, 55). To test for differences in bud set dates 
and autumn bud growth rates among treat- 
ments, we ran mixed-effects models including 
treatment as categorical fixed effect and bud 
types (apical versus lateral) as a random effect. 
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INTRODUCTION: Microbiome composition is 
tied to host health and ecosystem function. 
However, the processes that determine spe- 
cies abundances in an environmental context 
remain poorly understood. Given that micro- 
biomes are bounded by resource availability, 
insights into the metabolic capacities of their 
constituent organisms present a key avenue 
for predicting interspecies interactions and 
broader microbiome assembly rules. 

Plants constitute the largest biomass on 
Earth and are colonized by a variety of micro- 
organisms that affect their health and growth. 
The composition of plant microbiomes has 
been found to be largely deterministic, sug- 
gesting the presence of defined drivers of 
community assembly. However, the metabolic 
mechanisms of the interactions underlying 
these patterns remain poorly understood. 


RATIONALE: To assess the degree to which pat- 
terns of resource allocation contribute to micro- 
bial interactions, we first profiled the carbon 
source-utilization capabilities of 224 represen- 
tative bacterial strains isolated from leaves of 
wild Arabidopsis thaliana plants. To do this, 
we cultivated each isolate on 45 different carbon 
sources and developed a computational tool 
to score the growth of each strain in an auto- 


Carbon utilization profiles of leaf 
bacterial strains 


45 carbon sources 


224 strains * ¢ La 
0 —* 
Growth No growth 


Metabolic niche overlap 
and genome-scale models 


mated way. This screen allowed us to calculate 
the degree of metabolic niche overlap between 
the strains, which informed predictions of re- 
source competition on the plant host. We further 
leveraged this experimental data to generate 
a collection of 224 genome-scale metabolic mod- 
els, which encompass the metabolic networks of 
each organism. We used these models to predict 
the outcomes of interspecies interactions on 
the plant host and to obtain mechanistic in- 
sight into specific patterns of resource use and 
exchange between these bacteria. 


RESULTS: We found that the carbon source- 
utilization profiles of the strains exhibited a 
strong phylogenetic signature, on the basis of 
both the experimental screen and the path- 
ways represented in the genome-scale models. 
We validated the performance of the models 
and of predictions based on niche overlap by 
conducting competition experiments with two 
sets of strains on the Arabidopsis host. These 
experiments revealed a dominance of negative 
interaction outcomes (i.e., a strain reached a 
lower overall population level upon cocoloni- 
zation compared with monoassociation), which 
was in agreement with predictions of high 
interstrain niche overlap. The genome-scale 
models provided an additional degree of 


interaction outcomes 


Interaction outcome of strainA 


Predicted pairwise and community 


ath strain or community B 


NegatiVC Et _»»«—_<€ Positive 


predicting instances of positive outco.-2 


observed in planta and further underscoring 
the importance of carbon metabolism to com- 
munity assembly. After this experimental 
validation, we modeled interaction outcomes 
for more than 17,500 pairs of strains. We 
predicted that in 94% of pairings, at least one 
strain would experience a reduction in abun- 
dance compared with monoassociation. We 
then analyzed the metabolic fluxes underlying 
these predicted outcomes, which revealed a 
high prevalence of competition for sugars. 
This competition could be offset by increased 
uptake of amino and organic acids in our 
simulations, which points to the metabolic 
mechanisms underpinning the positive inter- 
action outcomes that we observed in planta. 


CONCLUSION: Our results indicate a major role 
for carbon source availability and metabolic 
interactions in community assembly in the 
plant host, which, together with the trait con- 
servation observed here, likely contribute to 
the overall deterministic outcomes of plant 
microbiome assembly. In addition to recapi- 
tulating interspecies interactions in situ, our 
modeling framework represents a powerful 
tool for determining the metabolic mecha- 
nisms that lead to the emergence of specific 
ecological patterns. This knowledge will ulti- 
mately enable targeted microbiome design, 
which is pivotal to microbiome applications 
in health, agriculture, and the environment. 
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Modeling of metabolic interactions predicts ecological outcomes for leaf microbiota members in situ. We mapped the resource-use capabilities of 224 

representative bacterial strains of the A. thaliana leaf microbiome by testing their ability to grow on 45 different carbon sources. These data allowed us to predict 
interaction outcomes between these strains using genome-scale models and metrics of niche overlap. Experimental validation in planta underscored the importance 
of resource competition and suggested metabolic mechanisms underlying specific interaction outcomes. 
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Resource allocation affects the structure of microbiomes, including those associated with living 
hosts. Understanding the degree to which this dependency determines interspecies interactions may 
advance efforts to control host-microbiome relationships. We combined synthetic community 
experiments with computational models to predict interaction outcomes between plant-associated 
bacteria. We mapped the metabolic capabilities of 224 leaf isolates from Arabidopsis thaliana by 
assessing the growth of each strain on 45 environmentally relevant carbon sources in vitro. We used 
these data to build curated genome-scale metabolic models for all strains, which we combined 

to simulate >17,500 interactions. The models recapitulated outcomes observed in planta with >89% 
accuracy, highlighting the role of carbon utilization and the contributions of niche partitioning and 


cross-feeding in the assembly of leaf microbiomes. 


arious hosts, including animals (J, 2), hu- 
mans (3, 4), and plants (5), support mi- 
crobial communities with hundreds to 
thousands of different species. Recent 
studies have revealed that a determi- 
nistic relationship exists between environmental 
composition and community structure, even for 
complex microbiomes (6-0). Thus, a thorough 
understanding of the resources available to a 
community could enable prediction of its spe- 
cies and functional composition. Not only would 
the ability to make such predictions help us bet- 
ter understand the relationship between envi- 
ronment and phenotype, but it would also 
provide an accessible way to rationally design 
synthetic communities with defined functions. 
Despite this potential, we still lack a com- 
prehensive understanding of the information 
necessary to predict whether a specific orga- 
nism will survive or be outcompeted in a parti- 
cular environmental context. Resolving this 
question remains an active area of research, 
with studies that use interaction outcomes 
between pairs of organisms to predict overall 
community behavior representing a particu- 
larly attractive approach (11-13). Nonetheless, 
conducting the necessary mapping of pos- 
sible interactions between all organisms re- 
mains experimentally challenging even for 
relatively simple communities and largely out 
of reach for complex host-associated micro- 
biomes in situ. 
Given these limitations, one may ask which 
ecological concepts can be used to predict the 
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role of individual organisms within a commu- 
nity context. Microbial populations are fun- 
damentally bounded by resource availability, 
where limiting amounts of micronutrients 
such as carbon, nitrogen, and phosphorus act 
to constrain community size and lead to the 
emergence of competition between organisms 
(14-16). Although the prevalence of these com- 
petitive interactions has begun to be quanti- 
fied in a variety of ecosystems (72, 17-20), 
the general rules by which they influence 
community-wide assembly patterns remain 
unresolved. As such, a commonly used ap- 
proach is to infer the competitive potential 
between organisms from the degree of metab- 
olic niche overlap between them, which relies 
solely on knowing their individual resource- 
utilization capabilities (27, 22). Although a 
resource-by-resource understanding of the 
metabolic profiles of complex microbiomes 
can be challenging to obtain, it represents a 
more accessible approach for inferring inter- 
species interactions and has proven success- 
ful at predicting high-level patterns of species 
diversity in communities (10, 23-26). 

Despite its benefits, the predictive power of 
niche overlap is limited in that it cannot 
account for the emergence of positive ecolo- 
gical interactions between organisms either 
through resource partitioning, cross-feeding, 
orthogonal resource-acquisition strategies, or 
evolved cooperation (27-30). As a complementary 
approach, genome-scale metabolic models pres- 
ent a tractable way to integrate the effects of 
these additional mechanisms into predictions 
of community structure (31, 32). Genome-scale 
models are mathematical representations of 
the metabolic capabilities of individual organ- 
isms. They incorporate genes, reactions, and 
metabolites associated with a given organism’s 


metabolic network and are applied to quanti- 
tatively assess how organisms can use available 
resources for growth (32). In addition to gener- 
ating predictions of resource allocation for 
individual organisms, combinations of genome- 
scale models of different organisms have been 
increasingly used to mechanistically describe 
pairwise and community-wide dynamics (33-35). 
Combining genome-scale models with pre- 
dictions of niche overlap therefore presents a 
powerful way to predict interspecies interac- 
tions within microbiomes. 

In this study, we generated metrics of metab- 
olic niche overlap and a collection of genome- 
scale models to predict interaction outcomes 
between 224 bacterial members of the Arabidopsis 
thaliana leaf microbiome. This environment, 
known as the phyllosphere, is an oligotrophic 
habitat where competitive interactions between 
its resident microbes are prevalent (19, 36, 37). 
Because it contains a variety of different car- 
bon sources (38, 39), it is an ecosystem that is 
well suited for studying the effects of resource 
allocation on interspecies interactions and 
community structure. In particular, the exposed 
nature of the phyllosphere makes it a relatively 
accessible setting for study in controlled con- 
ditions. The microbiota of the phyllosphere 
has been shown to confer beneficial functions 
to the plant host (40-42), which, together with 
the scale of colonizable surfaces presented by 
leaves (37, 43, 44) and the economic impor- 
tance of crops (45, 46), underscores the value 
of studying the processes underlying its assem- 
bly (47, 48). 


Results 
Profiling the carbon source-utilization 
capabilities of phyllosphere bacteria 


Comprehensive collections of environmental 
strains are valuable resources for studying 
relevant host-microbe and microbe-microbe 
interactions (42, 49-52). To examine interaction 
outcomes among members of the Arabidopsis 
phyllosphere (Fig. 1), we used the A+ LSPHERE: 
a collection of 224 strains isolated from leaves 
of wild A. thaliana plants that represent a cross 
section of the taxonomic and functional diver- 
sity of the phyllosphere microbiota (53). We 
first assessed the ability of each strain to grow 
on solidified minimal medium supplemented 
with individual carbon sources from a set of 45 
(table S1). These carbon sources, comprising 
a range of sugars, organic acids, sugar alco- 
hols, one-carbon compounds, aromatic com- 
pounds, and amino acids, were selected on the 
basis of the high prevalence of resources such 
as glucose, sucrose, and some amino acids that 
are known to account for a substantial frac- 
tion of carbon available to leaf epiphytes 
(36, 38, 39, 54). These carbon sources were 
also selected in part on the basis of the known 
ability of certain leaf strains to metabolize 
aromatic compounds, cleave glycosidic bonds, 
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Fig. 1. Overview of experimentally and computationally guided prediction and testing of metabolic interactions among bacterial members of the 


Arabidopsis phyllosphere microbiota. 


and/or utilize one-carbon compounds (37, 55). 
In addition to these carbon sources, we sup- 
plemented the media with vitamins to en- 
able the growth of auxotrophs (the set of 
carbon sources included amino acids, so we 
did not supply additional amino acids to the 
media). We used a plate assay to evaluate the 
growth of a given strain on a particular carbon 
source, which was compared with a carbon- 
free control, using both manual inspection 
and automated image processing (Materials 
and methods). 

Our in vitro metabolic screen revealed phy- 
logenetically contingent patterns of carbon- 
utilization capabilities across the At-LSPHERE 
collection (Fig. 2 and fig. S1). We found that 
strains belonging to the same genus displayed 
similar—or even identical—carbon source- 
utilization profiles, suggesting trait conserva- 
tion between closely related species (Fig. 2A). 
We also identified genus-specific signatures of 
resource utilization, as well as cases in which 
strains could not grow on any of the tested 
carbon sources (e.g., Chryseobacterium and 
Exiguobacterium). These latter strains most 
likely have nutritional requirements that are 
not met by the minimal medium we provided 
(e.g., amino acid auxotrophies), because they 
were able to produce visible colonies on a 
complex medium (R-2A+M, Materials and 
methods). By contrast, strains belonging to 
the genera Arthrobacter, Pseudomonas, and 
Rhizobium grew on a large number of carbon 
sources [27.5 + 6.9, 27.1 + 3.7, and 25.8 + 3.6, 
respectively (mean + SD)]. To quantify these 
differences, we assigned a metric of substrate 
versatility, V, to each strain, defined as the 
percentage of all 44 growth-yielding carbon 
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sources that each organism was able to use 
for growth (for this analysis, methylamine 
was excluded because it did not support the 
growth of any strain). The most versatile 
strain was Arthrobacter sp. Leafl45 (V = 77%), 
followed by Pseudomonas spp. Leafl5 and 
Leaf98, and Rhizobium sp. Leaf202 (V = 73, 
70, and 68%, respectively). When comparing 
the phylogenetic distribution of strain-specific 
substrate versatilities, we observed that Beta- 
and Gammaproteobacteria had above-average 
versatilities (Fig. 2B), whereas Actinobacte- 
ria (with the exception of Arthrobacter) and 
Bacteroidetes had below-average versatili- 
ties. Furthermore, our screen revealed the 
presence of a canonical niche occupied by 
methylotrophs that was distinct from and 
smaller than that of many other strains (Fig. 
2A and fig. S2). Indeed, the lower average 
versatility of Alpha- and Betaproteobacteria— 
which include Methylobacterium and Methyl- 
ophilus spp., respectively—relative to Gam- 
maproteobacteria reflects the low versatility 
of methylotrophs. 

Our screen also revealed the varying degrees 
to which each resource promotes growth (Fig. 
2C). To quantify these patterns, we assigned a 
measure of substrate fertility, F’, to each tested 
carbon source, defined as the percentage of 
strains that was able to grow on that particular 
carbon source. The most fertile or commonly 
used carbon source was glucose (F = 81% of 
strains), followed by succinic acid (F = 80%) 
and glutamate (F = 78%) (Fig. 2C and table S2). 
Among the most rarely used carbon sources 
were aromatic compounds, including trypto- 
phan (F = 9%) and coniferyl alcohol (F = 8%), 
a common component of lignin. 


Given the largely orthogonal metabolic 
niche occupied by methylotrophs relative to 
all other strains, we repeated our calculations 
of substrate fertility without methylotrophs to 
more clearly detect patterns of fertility among 
the other substrates. This analysis revealed that 
mono- and disaccharides build a core set of 
carbon sources commonly used by the large 
majority (F = 75%) of tested strains and that 
the most fertile substrate, glucose, was con- 
sumed by 95% of the strains. The next most 
common group of carbon sources (= 50%) 
included organic acids and amino acids that 
feed into the tricarboxylic acid (TCA) cycle 
(fig. $3). 

These experimentally determined carbon 
source-utilization capabilities allowed us to 
calculate the degree of niche overlap among 
our strains. We defined a niche overlap index 
(NOI) for each focal strain, A, with respect to 
any other strain, B, as the set of usable carbon 
sources shared by the focal strain and the 
competitor divided by the total number of car- 
bon sources used by the focal strain (23, 56): 

NOI, p = Nsom A0B 
N C Sources, A 

We calculated NOI values for all pairs of 
strains, setting a threshold of 75% as likely 
prone to competitive exclusion (57) (Fig. 2D). 
In accordance with the similar carbon source- 
utilization profiles, we found high niche over- 
lap within most genera, suggesting frequent 
intragenus competitive interactions. Addition- 
ally, we found that methylotrophic strains 
form two interconnected interaction spaces: 
one comprising Methylobacterium and the other 
comprising Methylophilus strains, underscoring 
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their degree of metabolic specialization com- 
pared with all other strains. Moreover, we 
found that members of the Microbacteriaceae 
had high niche overlap with a high number of 
other taxa, largely owing to their low substrate 
versatilities. We also observed the opposite 
case: Rhizobium strains had a low NOI with 
other taxa, whereas essentially all strains (ex- 
cept Methylobacteria) had a high NOI with 
Rhizobium strains. 


An in silico representation of phyllosphere 
bacterial communities 


Having screened the carbon source-utilization 
capabilities of the At+LSPHERE collection, we 
sought to model how these bacteria would 
interact metabolically in a leaf environment. 
To do this, we adopted a stoichiometric metab- 
olic modeling approach because it allowed us 
to explicitly simulate strain- and substrate- 
specific patterns of resource utilization, con- 
version, and exchange in a multitude of 
defined environmental conditions. We thus 
began by creating draft genome-scale metab- 
olic reconstructions (58) for each member of 
the At-LSPHERE. This process yielded a 
separate metabolic network for each of the 
224 At-LSPHERE strains, containing all the 
metabolic reactions predicted to be contained 
by each organism on the basis of its genome 
annotation. After generating these draft metab- 
olic reconstructions, we used flux balance 
analysis (37) to simulate the growth of each 
strain individually within a complete medium 
containing all 45 carbon sources used in our 
in vitro screen. We found that whereas some 
of our reconstructions were predicted to pro- 
duce biomass when using this complete me- 
dium, the majority could not do so when only 
individual carbon sources were provided (Fig. 
3A). Although this inability to grow was at 
odds with our experimental results, it was not 
unexpected because models generated solely 
from genomic information are often missing 
key metabolic pathways and thus have limited 
predictive power without additional experi- 
mental curation (33, 59-61). We therefore used 
the results of our carbon source screen to fill 
the apparent gaps in each metabolic reconstruc- 
tion by using the tool NICEgame (62), which 
suggests biologically relevant alternative path- 
ways to account for missing reactions by lever- 
aging information on reaction thermodynamics. 
We followed this step with additional manual 
curation (Materials and methods, table S3, 
and fig. S4) and then compared each of the 
resulting 224 models with their correspond- 
ing strains on the basis of their ability (or in- 
ability) to use the 45 carbon sources selected 
for our in vitro screen (fig. $5). This process 
resulted in each genome-scale model having 
a final balanced accuracy of 0.98 + 0.07 (Fig. 
3A) (mean precision 1, mean recall 0.96), with 
214 models (96%) having a balanced accuracy 
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of at least 0.85, and 189 models (84%) being 
100% accurate. 

Our curation process yielded a collection 
of 224 genome-scale models that revealed 
further metabolic capabilities and physiologi- 
cal characteristics of the At-LSPHERE collec- 
tion (Fig. 3). The models contained between 
1568 (Flavobacterium sp. Leaf359) and 3004 
(Pseudomonas sp. Leaf15) reactions (Fig. 3B), 
which moderately correlated with the genome 
size of their corresponding strain [coefficient 
of determination (R’) = 0.54, fig. S6A]. Reac- 
tions involved in the biosynthesis of glycero- 
phospholipids, key components of the cell 
membranes of Gram-negative bacteria, were 
enriched in members of the Proteobacteria 
within our collection (Fig. 3C). We also found 
Firmicutes and Actinobacteria to contain 
more reactions related to degradation of plant 
polysaccharides (e.g., melibiose, raffinose, and 
sucrose) relative to other taxonomic classes 
and identified an enrichment for benzoate 
degradation capabilities as previously reported 
in Betaproteobacteria (63, 64). The high degree 
of similarity to our strains’ known carbon 
source-utilization capabilities was supported 
by a clustering analysis of the reactions con- 
tained in each model (Fig. 3D). In this analysis, 
Methylobacteriaceae spp. and Methylophilaceae 
spp. formed clusters distinct from most other 
models, and Acidovoraz strains clustered to- 
gether with the majority of Pseudomonas and 
Rhizobium organisms. These clusters largely 
mirrored those generated from our in vitro 
data alone (Fig. 3E), which underscored the 
phylogenetic contingency of resource-utilization 
patterns and may underlie the deterministic 
assembly of microbiomes in general. With this 
in silico representation of the At+LSPHERE col- 
lection, we proceeded to use our genome-scale 
models to simulate the ecological outcomes of 
mixed-culture experiments. 


Computational models recapitulate ecological 
patterns in planta 


We assessed the predictive power of the 
genome-scale models and the NOI against 
experimental data in two in planta experi- 
ments (Fig. 4, A and B). In a first experiment, 
we inoculated A. thaliana seedlings with pair- 
wise combinations of seven bacterial strains, 
or with a community consisting of all seven 
strains (referred to as SynCom7), and com- 
pared the colonization level of each strain in 
combination with the colonization density 
achieved in monoassociation (Fig. 4B). To 
represent a range of metabolic capabilities, 
we selected strains across a gradient of sub- 
strate versatility values and hence a gradient 
of NOI. For the first experiment, we selected 
representative strains from the highly versatile 
Arthrobacter spp., Pseudomonas spp., and 
Rhizobium spp. (Leafl45, V = 77%; Leafl5, 
V = 73%; and Leaf202, V = 68%; respec- 


tively), along with four additional strains with 
intermediate-to-low versatility values: Rhodo- 
coccus sp. Leaf233, Sphingomonas spp. Leaf34 
and Leaf257, and Microbacterium sp. Leaf179 
(V = 55, 36, 45, and 41%, respectively). On 
the basis of NOI, we expected the strains 
with lower versatilities to more frequently de- 
crease in abundance upon cocolonization with 
other strains, either in pairwise combinations 
or in a community context (Fig. 4C). How- 
ever, all other strain pairings had low NOI 
values (<75%), limiting the degree to which 
we could infer interaction outcomes for these 
combinations. 

We used the genome-scale models corres- 
ponding to these seven strains to simulate 
their growth within an environment reflecting 
the carbon source availability of the Arabidopsis 
phyllosphere (Materials and methods). To 
obtain an indicator of the ecological outcomes 
of these pairings, we compared the predicted 
growth of each strain on its own with that in 
coassociation with another: A strain was 
deemed to have experienced a negative inter- 
action outcome if its biomass production rate 
in coassociation was lower than that on its 
own, and a positive outcome if its biomass 
production rate was higher in coassociation. 
These simulations predicted an approximately 
even distribution of positive and negative 
outcomes for all seven organisms (Fig. 4C). 
However, when we considered the magnitude 
of the changes in growth experienced by each 
strain, we found that predictions skewed more 
strongly negative than positive. This observa- 
tion is consistent with previous descriptions of 
the prevalence of competitive interactions in 
the phyllosphere (18, 19), which suggests the 
commonality of exclusion of one strain by 
another. Indeed, when we compared the growth 
of each strain on its own with that in the 
presence of all six other strains, we predicted 
almost exclusively negative outcomes. Despite 
this overall distribution, we found that the 
number of positive or negative outcomes dif- 
fered depending on the strain in question, with 
Microbacterium sp. Leafl79 and Sphingomonas 
sp. Leaf257 almost always experiencing a re- 
duction in growth when paired with another 
strain. By contrast, Rhizobium sp. Leaf202, 
Pseudomonas sp. Leaf15, and Arthrobacter 
sp. Leafl45 were most often predicted to 
benefit from coinoculation with another of 
our selected strains. These predicted positive 
outcomes corresponded to instances of low 
NOI values (Fig. 4C), suggesting that these 
strains could metabolically evade competition 
by a partner strain and may additionally bene- 
fit from changes in the ecosystem (e.g., cross- 
feeding) induced by the partner strain. 

The interaction outcomes of these seven 
strains in planta were largely congruent with 
those predicted by NOI and metabolic model- 
ing. Of the outcomes that showed a fold change 
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Fig. 3. Overview of collection of metabolic models for 224 bacterial 
members of the A. thaliana leaf microbiota. (A) Distributions of balanced 
accuracy as tested on 45 carbon sources for models generated solely from 
genomic information (top) and after curation (bottom). (B) Attributes of models 
as clustered by taxonomic phylum and class. Numbers bordering phylogenetic 
tree denote strain identity (e.g., 220 corresponds to Xylophilus sp. Leaf220). The 
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ring indicates the size of the genome used to generate each model (53), and the 


greater than 2 or less than 0.5, we found that 
a majority were negative, confirming the dom- 
inance of competition between these strains 
(Fig. 4D). In the SynCom7 condition, a sig- 
nificant abundance reduction was observed 
for all strains except for Rhizobium sp. Leaf202, 
matching our predictions and supporting the 
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presence of increased competitive pressure in a 
community context. Our experimental data 
also highlighted the predictive power of NOI 
data alone for predicting interaction outcome 
directionality, with 93% of instances of nega- 
tive outcomes corresponding to an NOI >0.75. 
Predictions generated by the genome-scale 
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third ring represents the versatility, V, of each model. The 

outermost bars represent the number of reactions contained in each model 
(minimum = 1568, maximum = 3004). (C) Clustered heatmap of 60 most 
highly represented reaction types, scaled by the number of models 
corresponding to each phylum and class. (D and E) Principal coordinates 
analysis (PCo) of strains as determined by (D) reactions in genome-scale 
models and (E) in vitro carbon source screen, with select genera 


models yielded an increased degree of gra- 
nularity, recapitulating the significant inter- 
action directionalities that we observed in 
planta with a balanced accuracy of 89% (tables 
S4 and S5). These predictions additionally 
captured the weaker directionalities of border 
cases that did not meet our experimental 
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Fig. 4. Model predictions and in planta validation of interactions. (A and B) Schematic overview of 

(A) interaction simulations using genome-scale models and (B) phyllosphere inoculation and interaction 
mapping procedure. Interactions are inferred by comparing a strain’s growth in combination with another 
strain with that in monoculture. This is computed as the log ratio of biomass fluxes for the genome-scale 
models or as the logs-fold change in CFUs per gram plant fresh weight for the in planta experiments. 

(C and E) Predicted interaction outcome for the strain indicated on the y axis in combination with the 
strain or SynCom indicated on the x axis, based on niche overlap index (NOI, top left) and genome 

scale models (GSM, bottom right). The fill color indicates high (>0.75, blue shades) or low (<0.75, gray 
shades) degree of niche overlap and predicted positive (red shades) or negative (blue shades) interactions 
based on genome-scale metabolic models. (D and F) Interaction outcomes observed in planta. Heatmap 
showing the logs-fold changes (logaFCs) (pairwise or SynCom inoculation versus monoassociation) for the 
strain on the y axis in combination with the strain or SynCom indicated on the x axis, based on absolute 
abundances obtained by CFU enumeration (n = 11 to 12). The color of the boxes reflects the observed logaFC, 
and the black frames around the boxes indicate a significant difference compared with the monoassociation 
condition (two-sided Wilcoxon rank sum test, Holm-adjusted P < 0.05). |LogsFC| < 1 are overlaid with a 
crosshatched pattern. Confusion matrices comparing modeling predictions and in planta outcomes are 
provided in table S4, and colony counts for the in planta experiments are provided in table S5. 
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cutoff, as seen in the positive interaction out- 
come for Sphingomonas sp. Leaf34 paired with 
Microbacterium sp. Leafi79, of Arthrobacter sp. 
Leafl45 paired with Rhizobium sp. Leaf202, 
and of Rhizobium sp. Leaf202 in the SynCom7 
condition. These weakly positive effects occurred 
despite high predicted NOI, suggesting the in- 
fluence of additional metabolic mechanisms 
not encompassed by competition. 

We sought to further validate our predic- 
tions with a second experiment spanning 
more extreme degrees of niche overlap. We 
generated pairings including the two highly 
versatile strains Rhizobium sp. Leaf202 and 
Arthrobacter sp. Leafl45 (V = 68% and 77%, 
respectively) as well as the low-versatility 
strains Frigoribacterium sp. Leaf8, Curtobac- 
terium sp. Leaf154, Rathayibacter sp. Leaf164, 
and Frondihabitans sp. Leaf304 (V = 27% for 
all four strains). All pairings resulted in a high 


degree of niche overlap for the low-versatility _ 


strains, whereas the NOI for either Leaf202 
or Leafl45 with any other strain was low. Cor- 
respondingly, the genome-scale models pre- 
dicted almost exclusively negative interaction 
outcomes for all low-versatility strains, with 
weakly positive outcomes for Leaf202 in com- 
bination with all strains except Leafl45, as 
well as stronger positive outcomes for Leafl45 
in combination with most other strains (Fig. 
4E). We additionally predicted interaction out- 
comes for each of the three low-versatility 
strains paired with a combination of Leaf202 
and Leaf145 (referred to as SynCom3), which 
resulted in strongly negative interaction out- 
comes for each strain. These predictions were 
tested with in planta experiments for these 
pairings: As predicted, all significant instances 
of logo-fold changes greater than 1 were nega- 
tive, including those for each low-versatility 
strain in the SynCom3 condition (Fig. 4F). 
Additionally, Leaf202 and Leafl45 each exper- 
ienced two instances of weakly positive out- 
comes when paired with other low-versatility 
strains, which were captured by our genome- 
scale modeling predictions. Our experiments 
confirmed the validity of the computational 
predictions (tables S4 and S5) and highlighted 
the strong contribution of resource competition 
to strain-specific interaction outcomes in situ. 

To test the specificity of our predictions to a 
leaf environment, we carried out a series of 
in vitro cultivations in which we cultured the 
same strain pairs and communities in shake 
flasks (Materials and methods). These culti- 
vations resulted in exclusively negative inter- 
action outcomes for all strains in the SynCom7 
and SynCom3 conditions (fig. S7), similarly to 
our in planta experiments. These outcomes, 
which are consistent with an additional set of 
community simulations that used randomly 
selected strains (fig. S8), highlight the increased 
degrees of competition that organisms may be 
subject to in multispecies settings. However, 
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Ratio of total sugar uptake flux 


Fig. 5. Model-predicted interaction outcomes and mechanisms. (A and 

B) Log.FCs of (A) biomass and (B) resource uptake fluxes for two representative 
interactions validated in planta. Dots indicate absolute logFCs of less than 
0.05 mmol gDW™ hour7?. (C) Predicted pairwise interaction outcomes between 
all 188 nonmethylotrophic strains in the At-LSPHERE (n = 35,156 outcomes 

for 17,578 pairs). Hierarchical clustering was performed on interaction outcomes, 
with strain-specific phylogeny highlighted. White cells denote instances of no 
Distribution of pairwise 
interaction outcomes (n = 35,156). Dashed lines separate outcomes in which a 
strain’s predicted biomass flux in coculture was either less than 90% of that 

in monoculture (strongly negative), within 10% of that in monoculture (neutral), 
or more than 110% of that in monoculture (strongly positive). (E) Classification 


predicted growth in both mono- and coculture. (D) 


our in vitro experiments also resulted in sub- 
stantially fewer positive outcomes of pairwise 
interactions when compared with our in planta 
results, suggesting competitive pressures in- 
herent to batch cultures that are not captured 
by our use of NOI and genome-scale models. 
In particular, the lack of spatial structure may 
favor strains that experience fast growth when 
substrate availability is high and thus out- 
compete slower-growing strains within the 
timescale of a batch experiment. Moreover, 
this rapid depletion of nutrients contrasts with 
the resource dynamics of leaf surfaces, which 
exhibit a steady resupply of resources that can 
be accessed by epiphytic microbes (38). Al- 
though we do not explicitly consider spatial 
structure in our use of NOI and genome-scale 
models, these tools generate predictions based 
on a broader consideration of the various re- 
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Ratio of total amino acid uptake flux 


sources that can be used by the organisms at 
steady state. This assumption may therefore 
better reflect a broader and more continuously 
supplied pool of resources, which can be used 
by microbes on a population level in spatially 
structured settings. 


Compensatory metabolic mechanisms offset 
resource competition 


In addition to predicting interaction outcome 
directionalities, we used genome-scale model- 
ing to explore the metabolic mechanisms that 
could be underlying the observed ecological 
patterns. We first examined experimentally 
validated interactions to determine changes 
in resource uptake rates that emerged as a 
result of pairing two strains together. As rep- 
resentative examples, we looked specifically 
at two interactions involving A7throbacter sp. 


Ratio of total organic acid uptake flux 


of pairwise ecological outcomes (n = 17,578). (F) Distributions of flux ratios 
between resource uptake in coculture and monoculture, according to corresponding 
interaction outcome. Only simulations in which a strain achieved growth in both 
monoculture and coculture are considered (n = 28,316 outcomes). Differences 
between uptake rates of resource types provided in the simulated medium are 
highlighted for sugars (left), amino acids (center), and organic acids (right). 
Distributions of uptake fluxes are statistically significant for all three resource types 
(p<«<1x 10°”) as determined by one-tailed Mann-Whitney U tests. For clarity, 
horizontal axes are truncated and show 98.8% of outcomes for sugars, 98.7% 

for amino acids, and 94.7% for organic acids. The dashed line at ratio of l separates . 
instances of lower or higher uptake flux between coculture and monoculture, with 
percentages highlighting the number of instances less than or greater than 1. 


Leafl45, a highly versatile strain that experi- 
enced a weakly positive effect when paired 
with Frigoribacterium sp. Leaf8 and a negative 
effect when paired with Rhodococcus sp. Leaf233 
(Fig. 4, C to F, and Fig. 5A). Our flux balance 
simulations had predicted that in both cocul- 
tures, Leafl45 would have a lower net uptake 
flux of sugars compared with those experi- 
enced in monoculture (Fig. 5B). This was also 
the case for Leaf8, which was additionally pre- 
dicted to experience a reduction in amino acid 
uptake flux when paired with Leafl45. Whereas 
a similar reduction in amino and organic acid 
uptake occurred for Leaf233, Leafl45 was able 
to take up amino acids at rates similar to those 
noted in monoculture. The contribution of this 
reallocation of resources to the dominance 
of Leafl45 can be seen in its interaction with 
Leaf8, in which Leafl45 was able to shift its 
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metabolism to take up a slightly greater quan- 
tity of amino acids, as occurred in monoculture. 
The increased availability of these resources, 
in part a product of the low metabolic activity 
of Leaf8, suggests metabolic cross-feeding as 
a contributing factor in the positive effect ex- 
perienced by Leaf145 in coculture. 

The emergence of distinct resource-allocation 
patterns between these strain pairs prompted 
us to ask how widespread they could be across 
a wider ecosystem. We thus used the genome- 
scale models to generate predictions of pairwise 
interaction outcomes for all nonmethylotrophic 
strains in the At-LSPHERE, carrying out a total 
of 17,578 pairwise simulations (Fig. 5C). These 
simulations revealed the high prevalence of 
competitive effects, with 63.2% of outcomes 
predicted to be negative (Fig. 5D). Of these, 
the large majority (representing 58.2% of all 
outcomes) were predicted to be strongly nega- 
tive, which we defined as a strain’s biomass 
production rate in coculture being less than 
90% of that in monoculture. Conversely, 25.5% 
of all outcomes were predicted to be strongly 
positive (biomass production in coculture more 
than 110% of that in monoculture), further un- 
derscoring the strength of competitive pres- 
sures within leaf bacterial communities. We 
combined these strain-specific effects to de- 
fine interaction outcomes between pairs, which 
revealed that in 94% of interactions, at least 
one participant experienced a negative out- 
come (Fig. 5E). As such, cooperative inter- 
actions were predicted to be relatively rare, 
with 3% of all pairwise interactions resulting 
in commensalism and only 1% resulting in 
mutualism. 

An analysis of metabolic fluxes across our 
simulations revealed key differences in rates of 
resource uptake between positive and nega- 
tive interaction outcomes (Fig. 5F). As expected, 
the models predicted significantly lower re- 
source uptake rates for organisms that de- 
creased in abundance relative to monoculture 
than for organisms that increased in abun- 
dance. However, whereas nearly all flux ratios 
associated with negative outcomes were below 
1 (occurring for sugars, amino acids, and or- 
ganic acids in 88.6, 90.7, and 78.4% of nega- 
tive outcomes, respectively), many positive 
interaction outcomes displayed increases in 
amino acid and organic acid uptake flux (in 
70.1 and 49.9% of positive outcomes, respec- 
tively). These patterns thus suggest a stron- 
ger degree of competition for sugars among 
phyllosphere bacteria, which is more likely 
to be offset by increased uptake of amino and 
organic acids. 

These predicted compensatory mechanisms 
suggested that strains with higher substrate 
versatilities would be more likely to increase 
in abundance when in the presence of another 
strain. Indeed, strains with very high versatil- 
ities (e.g., Acidovorax spp. and Pseudomonas 
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spp.) were predicted to have higher than aver- 
age rates of experiencing positive outcomes 
(fig. S9). However, strain-specific substrate 
versatility itself was a poor predictor of posi- 
tive outcome frequency (R? = 0.10) because 
strains with very low versatilities (e.g., Chrys- 
eobacterium spp., which did not grow on any of 
the supplied carbon sources individually, or 
Bosea sp. Leaf344) were also predicted to ex- 
perience strongly positive outcomes in rare 
cases. These effects suggest a role of cross-fed 
amino or organic acids as valuable sources of 
metabolic complementation (65), which in ad- 
dition to enabling highly versatile strains to 
outcompete a partner strain, can also facil- 
itate the survival of specialized organisms. 


Discussion 


Understanding the factors that contribute to 
the assembly of microbiomes remains a chal- 
lenge for the study of natural ecosystems. In 
this study, we have shown how predictions 
based on the resource-utilization capabilities 
of individual bacteria enable recapitulation of 
interaction outcomes observed in the phyllo- 
sphere. These predictions underscored the 
prevalence of competitive interactions between 
leaf strains. Additionally, the high accuracy 
(Fig. 4) of these predictions suggested that 
competition for resources is responsible for 
many outcomes observed in situ, as also re- 
cently observed in community contexts (9). 
In particular, our experiments showed how 
high degrees of niche overlap were reliably 
predictive of negative interaction outcomes 
in planta. This relationship demonstrates the 
role that highly versatile organisms play in 
communities, as well as the implications for 
the design of microbial consortia based on co- 
operative interactions, where pairings of highly 
competitive organisms can be avoided with 
greater certainty. Our work highlights how a 
resource-by-resource understanding of an orga- 
nism’s catabolic potential in monoculture is 
sufficient for making high-level predictions of 
competition in an ecologically relevant setting. 

In addition to mapping the metabolic ca- 
pabilities of a large strain collection at high 
resolution, our in vitro carbon source screen 
informed the curation of genome-scale meta- 
bolic models for each of the 224 strains. Our 
experimentally curated collection of genome- 
scale models allowed us to go beyond the po- 
tential of niche overlap by predicting positive 
interaction outcomes and suggesting their 
molecular mechanisms. Our curation process 
provided a quantitative basis for the need to 
incorporate experimental data into model 
generation, given the low accuracy of the ini- 
tial draft models, which were based on ge- 
nome annotations alone. Although improved 
annotation resources, as well as organism- 
specific biomass compositions, may serve to 
reduce this source of uncertainty (60, 61), we 


expect experimental curation to remain es- 
sential for selecting the optimal combination 
of gap-filled reactions to recapitulate an orga- 
nism’s exact resource-utilization profile. 

Our metabolic modeling predicted interac- 
tion outcomes on the basis of carbon source- 
utilization capabilities. Further curation that 
integrates additional properties such as vita- 
min auxotrophies and storage (66), as well as 
metabolic shifts occurring from gene regula- 
tion (67, 68), is likely to improve the quantitative 
predictive power of this framework (33, 69). 
Moreover, although our predictions based 
on metabolic mechanisms are informative of 
key aspects underlying the assembly of plant- 
associated microbiomes, they do not consider 
additional factors that are known to shape 
leaf communities, such as signaling mole- 
cules, antagonistic interactions, or host immu- 
nity (5, 70-73). A potential consequence of this 
limitation may be seen in the example of | 
Pseudomonas sp. Leaf15, which experienced 
a strong negative interaction in planta when 
paired with Rhizobium sp. Leaf202 (Fig. 4, C 
and D). Even though this outcome was qual- 
itatively predicted by our simulations, the pres- 
ence of a type VI secretion system in Leaf202, 
previously described to be active against a 
Pseudomonas strain (42), may also be produc- 
ing an additional inhibitory effect not captured 
in our modeling framework. Furthermore, al- 
though our experimental approach that quan- 
tified community composition on the basis 
of bulk sampling aligns with our use of flux 
balance analysis, it abstracts away the influ- 
ence of spatial structure on interaction out- 
comes. Leaf surfaces are known to exhibit 
considerable nutrient heterogeneity, which, 
together with the aggregation of microbes 
around stomata or in microdroplets, is likely 
to affect interaction directions at the micro- 
scale (36, 74, 75). Capturing varying degrees 
of uncertainty in our modeling predictions 
can partially represent this heterogeneity 
(fig. S10), but an improved understanding of 
the nutrient compositions at specific locations 
on the leaf may enable future parametrization 
of existing metabolic modeling tools that can 
explicitly consider spatial structure (76-79). 

Our present results underscore the strengths 
of an integrated approach for generating eco- 
logical predictions in a mechanistic yet scal- 
able way, while establishing a computational 
resource for further exploration of the molec- 
ular mechanisms that underlie the assembly 
of complex microbiomes. 


Materials and methods 
Cultivation of bacteria 


At-LSPHERE strains were grown on R-2A agar 
(Sigma-Aldrich) supplemented with 0.5% (v/v) 
methanol (R-2A+M) at 22°C. Strains were rou- 
tinely streaked out from a cryo stock stored 
at -80°C, grown for 4 days, streaked on fresh 
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R-2A+M plates, and grown for 3 more days 
before the start of experiments. If necessary, 
streptomycin (20 yg ml‘) was added to the 
medium for selection. 


Carbon source screen 


The minimal medium agar plates were pre- 
pared with a common minimal medium com- 
position (80). One liter of the final medium 
contained 2.4 g K,HPO,, 1.08 g NaH2PO, - 2HO, 
1.62 g NH,Cl and 0.2 g MgSO, : '7H,O, and 15 g 
noble agar (Becton, Dickinson and Company). 
The medium was supplemented with the fol- 
lowing trace elements: 15 mg NagEDTA - 2H.O, 
3 mg FeSO, - 7H.O, 4.5 mg ZnSO, - 7H,O, 3 mg 
CoCl, - 6H,0, 0.64 mg MnCl, 1 mg H3BOs, 
0.4 mg Na,MoO, - 2H,O, 0.3 mg CuSO, - 5H,O, 
and 3 mg CaCl, - 2H,O, and vitamins: 500 pg 
D-pantothenic acid hemi calcium salt, 100 pg 
biotin, 400 ug riboflavin, 400 ug thiamine 
HCl, 200 ug pyridoxal HCl, 150 ug p-amino 
benzoic acid, 200 ug cobalamin, 50 ug lipoic 
acid, 150 ug nicotinic acid, and 100 ug folic 
acid. All media components were prepared 
with Milli-Q quality water (Millipore). Each 
carbon source was added as a 10x stock 
solution to the premixed medium, except for 
coniferyl alcohol and tyrosine, which were 
added to the medium directly as powder. The 
concentration of each carbon source was nor- 
malized to 30 mM carbon (e.g., 5 mM glucose 
and 30 mM methanol). A list of all 45 carbon 
sources and final concentrations can be found 
in table S1. 

At-LSPHERE isolates grown on R-2A+M 
medium were suspended (1 ul loopful) in 1 ml 
10 mM magnesium chloride solution, which 
corresponds to an optical density at 600 nm 
(ODgo0) of approximately 0.1 to 0.3. The cell 
suspensions were transferred to 96-well plates 
before spotting on agar plates, leaving every 
other well empty in a checkerboard manner to 
reduce the risk of contamination and allow 
more space on the agar plate for each strain. 
Increasing the distance between spots also 
strongly reduces the risk of strain interactions 
based on secondary metabolite production 
(e.g., antibiotics) that could lead to false-negative 
results (42). Cell suspensions (2 ul) were spotted 
on top of each minimal medium plate con- 
taining individual carbon sources, as well as 
on a minimal medium plate without carbon 
source and a R-2A+M plate. Spotting was 
carried out using a Rainin Liquidator Man- 
ual Pipetting System (Mettler Toledo). Plates 
were dried under laminar flow. Bacteria were 
incubated at 22°C and photographs were taken 
at 5, 7, and 10 days after spotting. In sum, the 
screen comprised three total rounds includ- 
ing 96 strains each, and the last round also 
comprised 56 randomly selected strains that 
were screened a second time for validation 
purposes (table $7). Plate images are availa- 
ble through Zenodo (87). 
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Analysis of bacterial growth in carbon 

source screen 

Colony growth was scored for all strains after 
7 days (0 = no growth, 1 = growth) by compar- 
ing growth on each carbon source with the 
control plate without a carbon source. We used 
a dual approach incorporating automated 
growth scoring based on pixel intensities, 
using a common threshold for all strains and 
manual scoring through visual inspection. 

For the automated analysis, we developed 
the software tool platescan (github.com/ 
MicrobiologyETHZ/platescan), which enables 
unbiased scoring of all strains over different 
conditions with a firm growth cutoff. Briefly, 
platescan uses cross-correlation techniques to 
crop the plate image, locate the colony grid 
layout, and then determine the best-fitting 
colony size and location on the basis of the 
assumption that they are approximately cir- 
cular. The pixel intensities are rescaled between 
1 and 99% of the intensity distribution to en- 
sure that the images have approximately the 
same contrast. The program then reports fore- 
ground and background pixel intensities in red, 
green, blue, and grayscale, which are then used 
to threshold growth versus no growth. An 
example can be found on the program’s GitHub 
repository. 

We used platescan to assign growth values to 
each strain with the following parameters: -r:20, 
-p: 10,-min_r: 15,-max_r: 40,-edge: 200 40 200 
40. In addition, two parameters were set indi- 
vidually for each screening round because of 
variation in picture zoom level. Screen1: -x 105 
-y 108, screen2: -x 110 -y 109, screen3: -x 114 -y 113. 
We used the pixel intensities reported in the 
red channel with a defined minimum thresh- 
old to consider a strain as having grown. Pixel 
intensities were obtained by subtracting back- 
ground values from foreground values for 
each colony and subsequently subtracting the 
no-carbon control value. For the few cases in 
which strains spread into the area used for 
background intensity calculation, the back- 
ground value of the neighboring colony was 
subtracted. 

For a fraction of strains, we also observed 
substantial growth without the addition of a 
carbon source to the medium, suggesting that 
these strains could either grow on a compo- 
nent or impurity within the agar or were able 
to grow on volatile compounds present in the 
surrounding environment. Therefore, we also 
manually scored growth for all strains rela- 
tive to the control plate containing no carbon. 
Because the magnitude of colony formation 
differed between carbon sources, assigning a 
binary growth (1) or no growth (0) value was 
challenging for strains that exhibited residual 
growth on the no-carbon control or were gen- 
erally slow growing. We therefore introduced 
a third category of nonsignificant (NS) growth 
for the initial screen analysis but considered 


them as no growth for subsequent analyses to 
keep the number of false positives low (fig. SIIA). 
The results of the computational analysis with 
platescan and the manual analysis showed a 
very high overlap between the two methods. The 
highest overlap (with 94.6% of matches com- 
pared with the manually scored data) was ob- 
tained with a cutoff of 20 arbitrary units (a.u.). 
The comparison of the manually and com- 
putationally scored data showed that the 
computational method scored growth more 
often than the manual analysis (n = 485), but 
most of these were scored as NS initially (n = 
332) (fig. S12). To achieve the best accuracy 
possible, we applied a rigorous curation pro- 
cess to the screening data. First, we reinspected 
all cases that did not agree between the manual 
and computational analysis (20 a.u. cutoff) and 
all cases that were scored as NS manually. This 
revealed few false-positive and false-negative 
scores for both the manual and computational | 
analysis. In the case of the computational 
analysis, these mistakes were due to either a 
slight misalignment of the grid or reflections 
on the plate. To account for slow-growing strains 
and strains that exhibited high background 
growth on the no-carbon control condition, we 
also reinspected all instances that were close to 
the chosen pixel cutoff (10 to 40 a.u.). For these, 
we also inspected the growth after 10 days and 
scored growth on the basis of whether there was 
an increase in colony density over time. This 
latter step reduced the number of NS values 
observed in the initial analysis and mainly 
improved scoring of the previously mentioned 
slow-growing Actinobacteria (fig. SIIB). The 
curation procedure reduced the false-positive 
rate compared with the original analysis from 
3.3 to 2.3%, whereas the false-negative rate 
increased from 5.8 to 6.5%, with most uncer- 
tainty remaining for Methylobacterium spp. 
and some members of the Actinobacteria. 


Plant cultivation 


Plants were cultivated as described previously 
(19). In brief, A. thaliana Columbia (Col-0) seeds 
were surface sterilized (82) and stratified for 
4 days at 4°C. Arabidopsis were cultivated in six- 
well tissue culture plates (TechnoPlasticProducts) 
filled with 5 ml washed and heat-sterilized 
calcined clay mixed with 2.5 ml half-strength 
Murashige & Skoog medium pH 5.8 includ- 
ing vitamins (% MS, Duchefa). Seeds were 
placed in the center of each well. If a seedling 
did not germinate, a different seedling was 
transplanted from a separate plate after 7 days. 
Starting at day 7, each well was supplemented 
twice per week with 200 ul % MS, respectively. 
Plates were incubated in a growth chamber 
(Percival, CU41-L4) set to 22°C and 54% relative 
humidity with an 11-hour photoperiod, fitted 
with full-spectrum lights (Philips Master TL-D 
18 W/950 Graphica) and lights emitting a 
higher fraction of UVA and UVB (Sylvania 
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Reptistar FI8W/6500 K). The combined light 
intensity was set to 200 to 210 pmol m™ s™ for 
wavelength 400 to 700 nm and 4 to 5 umol m~ 
s ' for wavelength 280 to 400 nm. 


Phyllosphere inoculation 


Bacteria grown on R-2A+M agar plates were 
suspended in 10 mM MgCl, solution, and the 
ODgo0 was adjusted to 0.2 for each strain. The 
final inoculation suspension had an ODgoo9 of 
0.02 for all treatments. For single-strain inocu- 
lations, 150 ul of OD-adjusted strain suspension 
was added to 1.35 ml 10 mM MgCl, solution. 
For two- and three-strain combinations, 75 or 
50 ul of each strain were added, respectively. 
For the seven-strain community, 100 ul of each 
OD-adjusted strain suspension was mixed, 
and then a 10-fold dilution was prepared for 
the final inoculum. The final suspension was 
mixed well, and then 10-day-old Arabidopsis 
seedlings were inoculated by slowly pipetting 
50 ul over the whole seedling. A 10-fold dilu- 
tion series was prepared of each inoculum and 
spotted on R-2A+M agar plates to enumerate 
total bacterial load. For strain mixes, appro- 
priate dilutions were plated on R-2A+M agar 
plates to verify presence of all strains on the 
basis of colony morphology. 


Phyllosphere harvest and bacterial colony 
forming unit (CFU) enumeration 


Bacterial colonization in the phyllosphere was 
enumerated when plants were 28 or 29 days 
old. The whole phyllosphere was harvested 
with sterile tweezers and a scalpel and was 
placed in a 2-ml tube containing 200 pu] 100 mM 
phosphate buffer pH 7 and a sterile 5-mm metal 
ball. The weight of the tube was recorded with 
and without the plant for plant fresh-weight 
calculation. The harvested phyllosphere was 
subsequently crushed for 45 s at 30 Hz with a 
TissueLyser II (Qiagen). Phosphate buffer (600 11) 
was added to the crushed plant material and 
was mixed thoroughly; 100 1 of this suspension 
were transferred to a 96-well plate to prepare a 
10-fold dilution series in 100 mM phosphate 
buffer. The dilution series was spotted on R-2A+M 
agar plates. In addition, 50 ul of each 10°°- and 
10-*-fold dilutions were plated on 9-cm round 
R-2A+M agar plates. If selective plates contain- 
ing streptomycin were used (for Sphingomonas 
selection), 50 1] of dilutions 10‘ and 10~? were 
plated in addition. Plates were incubated at 
room temperature and CFUs were counted 
after 1 to 3 days on dilution series and after 4 to 
7 days on round plates. If a strain was not found 
on the lowest-available dilution, its value was 
set to 0.9 CFUs for this sample for further anal- 
ysis. Rhodococcus sp. Leaf233 was selectively 
grown on minimal medium supplemented 
with maltose (5 mM) when combined with 
Pseudomonas sp. Leaf15. Colonies of Leaf233 
in mixtures with Microbacterium sp. Leaf179 
were differentiated by re-streaking colonies 
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on minimal medium containing isoleucine. 
Mock treated plants (7 = 12) were included in 
each plant experiment to detect any system- 
atic contamination with external bacteria. 


Shake flask cultivations 


At-LSPHERE isolates grown on R-2A+M me- 
dium were suspended in 4 ml 10 mM magne- 
sium chloride solution at an approximate OD¢o 
of 3. Each strain was inoculated at an OD¢oo of 
0.025 into 100-ml baffled Erlenmeyer flasks 
containing 10 ml of a liquid minimal medium 
with the same base composition (ions and 
vitamins) as in the carbon source screen. This 
medium contained all 44: growth yielding car- 
bon sources used in the in vitro screen (table 
S1) at a total concentration of 10 mM C, with 
each carbon source at a relative concentration 
corresponding to the medium composition 
used in the modeling. Strains were inoculated 
in four biological replicates in monocultures 
and in the pairwise and community combina- 
tions used in the plant experiments. Cultures 
were grown at 22°C with shaking at 200 rpm 
for 60 hours to allow strains to reach stationary 
phase. Cell numbers in each culture were enu- 
merated through a 10-fold dilution series 
spotted on R-2A+M agar plates. 


Data analysis 


Carbon source-utilization data, plant colo- 
nization experiments, and shake flask experi- 
ments were analyzed with R 4.04 in RStudio. 
For the carbon source screen, false-positive 
and false-negative rates were calculated on the 
basis of 56 isolates that were screened twice 
(table S7). The strain phylogeny was based on 
full-length 16S ribosomal RNA gene sequences 
extracted from the genome sequence of each 
strain as described previously (18). Strains that 
did not grow on any carbon source were ex- 
cluded from all further analyses. The Manhattan 
distances of all strains based on carbon source 
utilization were calculated with the vegdist 
function of the vegan 2.5 package (83). Hier- 
archical clustering was conducted with the 
Aclust fanction with Ward method (ward.D2). 
Principal coordinate analysis was performed 
with the emdscale function in MATLAB R2021a. 
Bacterial colonization data was logy trans- 
formed to calculate the median colonization 
and statistical significance on the basis of the 
Wilcoxon rank sum test. P-values were corrected 
for multiple testing with the Holm method. 
Effect sizes for the shake flask experiments 
were calculated with the cohens_d function 
(var.equal = FALSE, hedges.correction = TRUE) 
in the rstatixv package (84). Logs-fold changes 
(combination/monoassociation) were calculated 
on the basis of the median absolute abundance. 

Data was visualized in R with the package 
ggplot2 (85) and was further annotated with 
Adobe Illustrator. For data analysis and visu- 
alization, the following R packages were used: 


tidyverse (86), ape 5.4-1 (87), ggridges 0.5.3 (88), 
shades1.4 (89), phylloR (18), and dendextend 
1.14 (90). 


Generation and curation of genome-scale models 


The tool CarveMe (58) was first used to generate 
draft metabolic models based on the assembled 
genomes of each strain in the At-LSPHERE 
collection [(53), BioProjects PRJNA297956 and 
PRJEB47672]. The quality of these genomes was 
assessed using the tool CheckM (97) (table S8), 
which reported a mean completeness of 0.993 + 
0.007 and a mean contamination of 0.005 + 
0.007 (mean + SD, [0,1]). We compared the pre- 
dicted growth capabilities of each draft model 
against data from our in vitro carbon source 
screen and performed the following steps on each 
draft model: First, we used the tool NICEgame 
[(62), github.com/EPFL-LCSB/NICEgame] to 
generate sets of new biosynthetic and transport 
reactions [drawn from the BiGG database (92)] 


that would allow the model to produce biomass ° 


from each growth-supporting carbon source 
identified in our in vitro screen. We used this 
process, which relies on known reaction thermo- 
dynamic constraints (93, 94), to produce at most 
10 alternative sets of reactions for each growth- 
supporting carbon source. We separately integ- 
rated each set of reactions into the draft model 
and tested whether it could produce biomass 
on a simulated minimal medium (table S6) sup- 
plemented with each of the 45 carbon sources 
used in the in vitro screen. We also tested the 
growth of the model when combinations of re- 
action sets from up to three carbon sources were 
integrated, selecting the most accurate model 
(i.e., the most representative with the fewest 
false positives) as measured by Matthews cor- 
relation coefficient [(7P x TN — FP x FN)/ 
/ (TP + FP)(TP + FN)(TN + FP)(TN + EN)| 
for further curation. Most models reached this 
stage having had sets of reactions from only 
one (111 models) or two (82 models) growth- 
supporting carbon sources integrated, as add- 
ing reactions that enable growth on one 
resource can resolve additional gaps that 
enable growth on other resources. The carbon 
sources most often used for gap-filling were 
maltose, succinate, gluconate, and xylose (being 
used to gap-fill 40, 26, 23, and 21 models, re- 
spectively). To correct for remaining false nega- 
tives (i.e., the model did not grow on a carbon 
source that supported growth in vitro), we 
further added reactions from a different model 
in our collection that did recapitulate growth 
on the relevant carbon source. A maximum 
false-positive rate of either 10 carbon sources or 
half of all true positives for the model, which- 
ever was smaller, was set to avoid integrating 
an excessive number of new reactions. False 
positives for a given carbon source were cor- 
rected by removing internal reactions rele- 
vant to its metabolism or by restricting the 
relevant transport reaction when this was not 
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successful. Model- and carbon source-specific 
accuracies are summarized in fig. $5, and re- 
actions added to the models as part of the 
curation are enumerated in fig. S4 and table S3. 
We identified no relationship between model 
accuracy and either the size (fig. S6B) or the 
completeness of the underlying genome (fig. 
$13). A final quality control was performed on 
each model (59), consisting of testing for mass 
and charge balance; performing a leak test to 
ensure no metabolites could be produced from 
nothing; standardizing the metabolite name- 
space; adding reaction subsystems; and adding 
additional gene, metabolite, and reaction iden- 
tifiers when available. A final report was gen- 
erated for each model with the validation tool 
MEMOTE [total score = 0.84 + 0.02 (mean + 
SD, 2 = 224)] (95), and balanced accuracies for 
individual models and pairwise interaction pre- 
dictions were calculated as (TPR + TNR)/2, 
where 7PR and TNR represent the true-positive 
rate and false-positive rates, respectively. All 
scripts for model generation and simulation, as 
well as the models and quality reports, are avail- 
able at github.com/VorholtLab/i-At-LSPHERE. 


Computing mono- and coculture growth 


All growth simulations were performed using 
the COBRA Toolbox v2.24.3 (96) with the 
CPLEX solver v12.10 (IBM) in MATLAB R2021a 
(MathWorks). Nonlimiting amounts (0max = 
1000 mmol gDW* hour”) of a minimal me- 
dium composition containing ions, water, and 
sources of nitrogen, sulfur, and phosphorus 
were provided to the models, along with vita- 
mins at Umax = 0.15 mmol gDW* hour” (table 
S6). Limiting amounts of the 45 carbon sources 
tested in our in vitro screen were provided at 
abundances intended to broadly estimate the 
relative availabilities of resource types on leaf 
surfaces (36, 38, 39, 54) (Omax = 0.15 mmol 
gDW “hour” for sugars and organic acids, 
0.075 mmol gDW”™ hour” for amino acids, 
and 1.5 mmol gDW”™ hour” for methanol). 
Model growth was simulated with biomass 
as the objective function and a minimal ATP 
maintenance flux of 0.5 mmol gDW hour”. 
Optimizations were also set to minimize all 
reaction fluxes to more closely simulate efficient 
proteome utilization and minimize metabolite 
cycling (97). For each pair or community, the 
growth of each strain was first simulated in- 
dividually, and the resulting biomass flux val- 
ues and resource uptake fluxes were recorded. 
Models were then merged by integrating them 
into a common extracellular compartment 
(98, 99). Coupling constraints were introduced 
to avoid infeasible solutions in which one or- 
ganism produced metabolic flux for the other 
without producing biomass itself (34, 100). 
Additionally, metabolite uptake directional- 
ities were fixed to those observed in monocul- 
ture to minimize inconsistencies in resource 
preferences between mono- and coculture. 
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The sum of biomass reactions was set as the 
objective to be optimized, and exchange reac- 
tion Uma, Values were set as equal to those of 
the models in monoculture, thus simulating 
an equal abundance of resources between the 
mono- and coculture conditions. The resulting 
biomass flux values were recorded and used to 
calculate interaction scores, which were de- 
fined as the log, ratio of biomass flux in co- 
culture to that in monoculture. Interaction 
scores were limited to between -5 and 5, with 
extreme log,-fold changes falling outside this 
range being set to -5 or 5. 
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INTRODUCTION: Discovered in the mitochondrion 
of Trypanosoma brucei, a protozoan parasite 
that causes African sleeping sickness, RNA edit- 
ing denotes a spectrum of phylogenetically 
widespread and often mechanistically unre- 
lated molecular processes that change RNA se- 
quence. In trypanosomal RNA editing, gRNAs 
direct massive recoding of cryptic mitochondrial 
transcripts to generate mRNAs. Assembled into 
the partially defined editosome, two principal 
ribonucleoprotein particles execute an unconven- 
tional method of genetic information transfer 
from gRNA to mRNA. The RNA-editing substrate- 
binding complex (RESC) stabilizes gRNAs and 
engages mRNAs. The RNA-editing catalytic com- 


plex (RECC) fulfills gRNA-programmed mRNA 
cleavage, uridine insertion or deletion, and rel- 
igation reactions. 


RATIONALE: Mitochondrial gRNA forms an im- 
perfect duplex with mRNA precursor in which 
secondary structure defines multiple editing sites. 
To reveal gRNA stabilization and mRNA recog- 
nition mechanisms, we have determined atomic 
structures of three states of the RESC using cry- 
ogenic electron microscopy and characterized 
individual subunits’ RNA-binding specificity. 


RESULTS: Biochemical studies defined the RESC 
as a heterogenous assembly of ~18 proteins that 
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Structures of RESCs. (Left) RESC-A sequesters gRNA termini, promoting hairpin formation and blocking 
mRNA access. (Right) RESC-A conversion into RESC-B unfolds gRNA and allows mRNA recognition, likely 


exposing editing sites to RECC-embedded enzymes. 
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aL 
enclose gRNAs and mRNAs. Prior work aot 
demonstrated that the RESC1/2 dimer stabi... — 
gRNAs and most other proteins bind mRNAs. 
The structure of the six-member RESC-A shows 
how RESC2 RNA triphosphatase pseudoenzyme 
engulfs the triphosphorylated gRNA’s 5’ end, 
and how the RESC5/6 dimer fastens the 3’ end. 
These contacts promote gRNA folding into a 
“hairpin-like” conformation and shield both 
termini from nucleases. The 10-polypeptide 
RESC-B structure suggests that a remodeling 
event recovers gRNA from the RESC-A “storage” 
mode and transitions the single-stranded mol- 
ecule into mRNA proximity. In the process, the 
gRNA’s 5’ end is ejected from the RESC2 tri- 
phosphate binding tunnel but the 3’ end re- 
mains wedged between RESC5 and RESC6, 
which are the only proteins shared between 
RESC-A and RESC-B. All RESC-B subunits, in- 
cluding RESC5/6, contact mRNA along a ~20- 
nucleotide segment. However, gRNA andmRNA | 
do not interact within RESC-B boundaries. 

A typical gRNA starts with an “anchor” fully 
complementary to the mRNA target; the adja- 
cent “guiding” part pairs with mRNA sparsely 
and creates editing sites typified by single- 
stranded bulges and loops. A few nucleotides 
separate this “information-rich” sequence from ‘ 
the terminal uridine tail. Mechanistically, se- 
questering gRNA’s “information-poor” 3’ end 
inside RESC-B allows guiding and anchor parts 
to hybridize with mRNA beyond RESC-P’s sur- ‘ 
face. Apparently, RESC-B proteins recruit gRNAs 
and mRNAs irrespective of their sequences and 
position the two strands in a roughly antipar- 
allel orientation, but distant enough to prevent 
spurious annealing within the complex. The ex- 
posed gRNA and mRNA regions likely sample 
each other until productive hybridization cre- 
ates a substrate for the catalytic RECC complex. 


CONCLUSION: The architectures of distinct + 
RESC states reveal a diversity of protein folds ‘ 
that have been co-opted into the RNA editing 
machinery. We have discovered how common 
gRNA elements function in stabilizing this . 
short molecule and engaging mRNA. Our results 
show that gRNA-mRNA recognition emanates 
from ribonucleoprotein complex remodeling 
rather than initiating base pairing of the 
anchor sequence with the mRNA target. Last, 
structural information on essential functional 
features, such as the RESC2 triphosphate bind- 
ing site, may facilitate development of new 
trypanocides. 
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1,2% 


In Trypanosoma brucei, the editosome, composed of RNA-editing substrate-binding complex (RESC) 

and RNA-editing catalytic complex (RECC), orchestrates guide RNA (gRNA)-programmed editing to recode 
cryptic mitochondrial transcripts into messenger RNAs (mRNAs). The mechanism of information transfer from 
gRNA to mRNA is unclear owing to a lack of high-resolution structures for these complexes. With cryo-electron 
microscopy and functional studies, we have captured gRNA-stabilizing RESC-A and gRNA-mRNA-binding 
RESC-B and RESC-C particles. RESC-A sequesters gRNA termini, thus promoting hairpin formation and 
blocking mRNA access. The conversion of RESC-A into RESC-B or -C unfolds gRNA and allows mRNA selection. 
The ensuing gRNA-mRNA duplex protrudes from RESC-B, likely exposing editing sites to RECC-catalyzed 
cleavage, uridine insertion or deletion, and ligation. Our work reveals a remodeling event facilitating gRNA- 
mRNA hybridization and assembly of a macromolecular substrate for the editosome’s catalytic modality. 


inetoplastids are a group of flagellated 

protozoans that infect humans and live- 

stock in some of the most impoverished 

regions of the world. Trypanosoma brucei 

spp. cause African human and animal 
trypanosomiasis, a substantial health threat 
and economic burden, respectively, in sub- 
Saharan Africa. The toxicity and complex regi- 
mens of existing treatments call for identification 
of potential parasite-specific drug targets, among 
which the distinct organellar RNA-editing com- 
plexes stand prominent. In trypanosomes, RNA 
editing restores the protein coding capacity 
of mitochondrial cryptogenes by massive post- 
transcriptional insertions and deletions of 
uridines (uridine, U) (, 2). Short noncoding 
RNAs bind to and program sequence changes in 
the mRNA. Astutely termed guide RNAs (gRNAs) 
are ~50-nucleotide (nt) molecules that feature 
a triphosphorylated 5’ end, followed by the 
“anchor” region complementary to mRNA tar- 
get, the “guiding” part, and often an uridylated 
3' end (3-5). In massively edited (pan-edited) 
mRNAs, overlapping gRNAs sequentially bind 
to mRNA as recoding progresses from the 3’ to 
the 5’ end (6). Discoveries of RNA editing and 
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gRNAs in trypanosomes have laid the foundation 
for developing modern RNA-directed genome- 
and transcriptome-altering technologies. 

The partially defined T: brucei editosome in- 
cludes RNA-editing substrate-binding com- 
plex (RESC) and RNA-editing catalytic complex 
(RECC) (7-9). Molecular studies have impli- 
cated homologous binding partners RESC1 
and RESC2, previously named GRBC1/2 (10) or 
GAP2/1 (12), in gRNA stabilization and demon- 
strated their presence in heterogeneous (0.3 
to 1.2 MDa) ribonucleoproteins (RNPs) in- 
volving up to 18 polypeptides [RESC protein 
nomenclature is available in (8)] (JO-12). The 
RNPs that contain the RESC1/2 dimer incorpo- 
rate editing substrates (gRNAs and pre-edited 
mRNAs) and products (partially and fully 
edited mRNAs) (7, 12-14). Conversely, the RECC 
complex encloses enzymes performing gRNA- 
programmed mRNA cleavage, U insertions or 
deletions, and religation (15-17). 

Efforts to understand editing mechanisms 
at the structural level remain limited to crys- 
tallographic studies of recombinantly produced 
individual enzymes (18-20) and factors (21-23). 
Previous electron microscopy (EM) attempts 
yielded negative-stain structures of the cata- 
lytic editing complex at ~30-A resolution (24, 25). 
In this study, we combined biochemical, mass 
spectrometry, in vivo RNA profiling, and cryoID 
(26) approaches to capture three states of the 
RESC and reveal their atomic details by single- 
particle cryo-electron microscopy (cryo-EM). 


Results 
Capturing three states of RESC: RESC-A, 
RESC-B, and RESC-C 


To produce isolates amenable to cryo-EM re- 
constitutions at near-atomic resolution, we ini- 


tially purified endogenous RESC2 (Fig. 1A and 
fig. S1) and separated the associated complexes 
by glycerol gradient and native polyacrylamide 
gel electrophoresis (PAGE) to resolve the pre- 
dominant ~280 kDa and less abundant ~0.5 
to 1.2 MDa complexes (Fig. 1B). Although the 
RESC1/2 dimer binds gRNAs in vitro (0, 27), 
distribution along the gradient shows that the 
most abundant ~280 kDa RESC1/2-containing 
particles are devoid of gRNAs, which are se- 
questered into complexes exceeding 400 kDa 
(Fig. 1B, bottom). To capture these larger gRNA- 
containing assemblies, we tested RESC2, RESC5, 
RESC9, and RESC14 isolates for RESC com- 
ponents and associated mRNA-processing com- 
plexes (fig. S2 and table SI). The RESC2-purified 
sample contained abundant RESC1/2 and KREH2 
helicase, whereas both RESC9 and RESC14: iso- 
lates lacked RESC1 to RESC4 and RESCI15 to 
RESC18, respectively. Compared with the others, 
the RESC5 complex comprised a more uniform | 
set of RESC proteins and higher relative amount 
of the catalytic RECC (fig. S2). Size separation 
of RESC5 (Fig. 1C) demonstrated cosedimen- 
tation with RESC4 (Fig. 1D), which is pre- 
dicted to interact with RESC1/2 and RESC5 
(7, 13). Likewise, native PAGE revealed RESC5 
presence in distinct complexes (Fig. 1D), of ‘ 
which RESC1/2 co-occupies the ~400-kDa par- 
ticles and those exceeding 600 kDa (Fig. 1E). 
Informed by these findings, we selected RESC5 
for cryo-EM analysis. To mitigate sample het- 
erogeneity, mitochondrial lysate was treated 
with ribonucleases (RNases) before purification; 
this step reduced copurification of RESC12/12A 
mRNA-binding proteins (28, 29) and mRNA 
editing (RECC and KREH2C) and processing 
(KPAC and PPsome) complexes (8) (Fig. IF and 
table S2). 

The RNase-treated RESC5 sample was evalu- 
ated by means of negative-stain EM and cryo- 
EM. The 2D averages of the cryo-EM particles 
revealed multiple species with miscellaneous 
shapes and dimensions (fig. S3, A and B). By 
combining these 2D classes for in-depth 3D 
analysis (fig. S4), we obtained a series of 3D 
structures, among which three were identified 
as RESC protein-containing complexes through 
the cryoID approach (26), and their atomic mod- 
els were built. We designated them as RESC-A 
(3.7 A), RESC-B (3.4 A), and RESC-C (3.3 A) (Fig. 
1G, figs. S9 and S10, table S3, and movie S1). 
RESC-A contains RESC1 to RESC6, including 
the gRNA-stabilizing heterodimer RESC1/2. 
RESC-B incorporates RESC5 to RESC11, RESC13, 
and RESC14, thus sharing the RESC5/6 hetero- 
dimer with RESC-A and possessing the mRNA- 
interacting RESC13 (28, 29). The smallest 
complex, RESC-C, consists of RESC5 to RESC8, 
RESC10, and RESC14; it may represent a RESC-B 
assembly intermediate or reflect RESC-B re- 
modeling during editing (Fig. 1G and fig. S8). 
Most RESC components are composed of re- 
petitive arrays of short amphiphilic a helices 
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tagged RESC5 separated 
by means of SDS-PAGE 
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(left), or by glycerol gradient 
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by immunoblot. (E) Mem- 
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stabilizing (PPsome), polyadenylation (KPAC), and KREH2 RNA helicase (KREH2C) complexes in RESC5 isolates from mock- and RNase-treated extracts. Heatmap represents 
og2-fold change values. RNase treatment of the extract prevented RESC5 co-isolation with RECC, PPsome, KPAC, and KREH2C; RESC12 and RESC12A paralogs also 


declined. Supporting mass spectrometry data are provided in table S2. (G) Distinct views of RESC-A (left), RESC-B (middle), and RESC-C (right) cryo-EM density maps. 
(Insets) Models overlayed on transparent cryo-EM density. Additional overlays are provided in fig. S9. 


called HEAT repeats (also known as a solenoid) 
(fig. S11). HEAT repeats often form extended 
superhelical structures and mediate protein- 
protein interactions (30, 31), as observed here. 
Apparent molecular masses of RESC5 com- 
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plexes agree with values calculated from struc- 
tural models; RESC-A (~380 kDa) and RESC-B 
(~600 kDa) likely populate the upper band, 
with the lower band representing RESC-C 
(~270 kDa) (Fig. 1D and fig. S8). 


Depletion of RESC12 and RESC12A from 
RESC5 isolates through RNase treatment (Fig. 
1F) led us to inquire whether these paralogs 
that share 77% protein sequence identity and 
bind to pre-edited mRNA (28, 29) connect with 
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RESC by means of an RNA component (Fig. 1F). 
For RESC12 and RES12A isolations without 
RNase treatment, paralog-specific peptide 
spectra counting showed reciprocal RESC12 
and RESC12A relative abundances and their 
similar levels in RESC5 isolate (table S4). Over- 
all, we conclude that RESC-A and RESC-B in- 
teract with KREH2 helicase and pre-edited 
mRNA binding factors, respectively, through 
RNA. As detailed below, the atomic models of 
RESC-A, RESC-B, and RESC-C reveal interac- 
tions among their subunits, the mechanisms 
of gRNA stabilization, and g2NA-mRNA recog- 
nition. Our analyses also suggest the mechanism 
by which a ribonucleoprotein substrate for the 
catalytic RECC assembles. 


Structure of RESC-A 


RESC-A contains six proteins and has dimen- 
sions of 180 by 115 by 95 A (Figs. 1G and 2A). 
The presence of the RNA triphosphate tunnel 
metalloenzyme (TTM) domain (32) typifies 
RESCI1 and RESC2; the latter also has an N- 
terminal armadillo (ARM) domain. RESC5 
and RESC6 consist of $-propeller and super- 
helical HEAT domains, respectively. RESC3 
and RESC4 are characterized by superhelical 
HEAT domains flanked by an N-terminal an- 
chor and a C-terminal clip domain, respectively 
(fig. S11). RESC1 and RESC2 share a protein 
sequence identity of 31% and form a hetero- 
dimer (Fig. 2B and fig. S13B). A hairpin loop 
(amino acids 181 to 212) protruding from 
RESC2’s TTM runs along the crevice outside 
the triphosphate tunnel of RESC1, yielding 
extensive hydrophobic interactions and main 
chain-main chain hydrogen bonds (Fig. 2B, 
middle and right). Hydrogen bonds between 
RESC2’s ARM and RESC1’s TTM further en- 
hance association (Fig. 2B, top left). Extensive 
RESC1/2 contacts likely underlie their interde- 
pendent persistence in the cell (17). The RESC1/2 
heterodimer occupies the superhelical inner 
surface of RESC6’s HEAT by means of the ex- 
tended loop of RESC2’s ARM, forming hydrogen 
bonds (Fig. 2C). RESC5 associates with RESC6 
through hydrogen bonds and hydrophobic 
contacts (Fig. 2D), generating a positively 
charged crevice between RESC5 and RESC6 
that is potentially receptive to RNA binding 
(Fig. 2E). RESC3 and RESC4 stabilize the ori- 
entations of RESC1/2 and RESC5/6 hetero- 
dimers (Fig. 2A and fig. S12, A and B). 


Guide RNA interactions in RESC-A 


The RESC-A structure described above was 
determined in the RNase-treated isolate (Figs. 
1F and 2A, and figs. S4 and S5). Because mo- 
lecular studies identified RESC1/2 as the key 
gRNA-stabilizing element (J0), we next iso- 
lated RESC5 from mock-treated lysate and 
determined RESC-A structure with gRNA at 
3.7-A resolution (Fig. 3A, figs. S6 and $7, and 
movie S2). The overall structures are similar 
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(Fig. 3B, left), except that disordered loops 
close to gRNA in RESC1 and RESC2 become 
organized and visible in the presence of gRNA 
(Fig. 3B, inset). The triphosphate binding tun- 
nel of RESC2 engulfs the gRNA’s 5’ end while 
the 3’ end docks into the crevice formed by 
RESC5 and RESC6 (Fig. 3, A and C). Specifi- 
cally, the gRNA’s 5’ triphosphate and conserved 
transcription initiation sequence contact the 
triphosphate binding tunnel, whereby the “an- 
chor” and “guiding” regions hybridize into an 
~18-base pair (bp) stem (Fig. 3D) reminiscent 
of gRNA folding in solution (33, 34). The lock- 
ing of anchor and guiding regions into a hairpin 
suggests that gRNA must undergo a drastic 
conformational change to recognize mRNA. 
In addition to the 5 nt at the 3’ end lining up 
the crevice between RESC5/6, a total of 47 nt 
were modeled for gRNA in RESC-A (Fig. 3D). 
Accounting for the unstructured loop region, 
the RESC-bound gRNA matches the median 
49-nt length of encoded gRNA sequences (35). 
Our findings imply that a hairpin conformation 
and 3’-end shielding by RESC5/6 render gRNA 
refractory to 3’-to-5' degradation (4, 36, 37). 
We conclude that RESC-A, rather than solely 
the RESC1/2 heterodimer (0, II, 27), is the 
gRNA-stabilizing particle. 

gRNA is the only class of mitochondrial tran- 
scripts that retains 5’ triphosphate character- 
istic of transcription start site (3, 38), whereas 
the nonencoded 3’ U tail is added by the pro- 
cessome after precursor trimming (4, 5). To 
investigate the functional importance of the 
triphosphate binding by RESC2, we superim- 
posed RESC1 and RESC2 TTM domains with 
Cetl mRNA triphosphatase from Saccharomyces 
cerevisiae (32). Structural comparisons and se- 
quence alignments of 7. brucei and related 
parasite Leishmania major proteins show sub- 
stitutions of two glutamic acids in the catalytic 
metal binding triad (E305-E307-E496; Cetl 
numbering) that typify RESC1 and RESC2 as 
hydrolytically inactive pseudoenzymes (Fig. 3, 
E to G). The lack of phosphatase activities in 
RESC5 isolate and reconstituted RESC1/2 het- 
erodimer support this conclusion (fig. S13). 
However, in the gRNA-occupied RESC2 TTM 
tunnel, positively charged residues (K311, R402, 
and R424; RESC2 numbering) form hydrogen 
bonds with the 5’ triphosphate moiety (Fig. 3, 
D and F). In RESC1, substitutions of potential 
triphosphate binding positions (R393 and 
K456; Cet1 numbering) by acidic or polar side 
chains (Fig. 3G) likely explain the lack of 
gRNA occupancy (Fig. 3A). To test the essen- 
tiality of triphosphate contacts, we constructed 
cell lines for inducible overexpression of RESC2 
mutants and tested their growth kinetics. Re- 
placing E240A/N242A, residues in positions 
structurally homologous to Cetl catalytic metal- 
binding side chains E305/E307, exerted no 
growth inhibition, which is consistent with the 
lack of a Mg”*-binding site in RESC2 (Fig. 3, F 


and G). By comparison, mutations of triphosphate- 
binding amino acids caused growth-inhibition 
phenotypes ranging from mild (K311A) to mod- 
erate (R402A/K406A) to severe (R424A) (Fig. 
3H). As a control, R408A mutation in RESC1 
(corresponding to R424.A in RESC2) produced 
no discernible growth defect (fig. S14). 

The long-debated U-tail function (3, 5, 39, 40) 
compelled a closer scrutiny of the 3'-end bind- 
ing in the RESC5/6 crevice (Figs. 2E and 3, C 
and D). Ultraviolet (UV) irradiation of live par- 
asites and sequencing of RNAs cross-linked 
to RESC1/2 and RESC5/6 (table S5) demon- 
strate that RESC-A shields the entire gRNA 
and indicate that RESC5 binds longer gRNA 
fragments than RESC2 (Fig. 31). The difference 
emanates from RESC5-bound gRNAs possess- 
ing extended U tails (Fig. 3J). Consistent with 
uridylation patterns of predicted gRNAs versus 
gRNA-like molecules encoded by pseudo-gRNA 
genes (35), we found that RESCS functionally _ 
selects gRNAs with longer U tails (Fig. 3K). 
Conversely, the lack of U-enriched motifs in 
RESC5-bound fragments stipulates a “molec- 
ular ruler” mechanism of gRNA selection: The 
length, rather than the sequence of nonen- 
coded additions, delivers the 3’ end into the 
crevice between RESC5 and RESC6 (Fig. 3C). 
Taken together, our structural and functional 
data show that RESC-A selects gRNA by rec- 
ognizing the 5’ triphosphate and accommo- 
dating a defined length and the capacity to 
fold into a hairpin-like conformation. 


mRNA recognition in RESC-B 


We also determined structures of RESC-B from 
RNase- and mock-treated RESC5 isolates. Al- 
though the latter has extended RNA densities 
and additional protein components, their over- 
all structures are similar. The following descrip- 
tion refers to their common features unless 
otherwise stated: RESC-B shares RESC5/6 het- 
erodimer with RESC-A and contains addi- 
tional components RESC7 to RESC14 [Figs. 
1G (RNase-treated) and 4A (mock-treated) and 
movie $3]. Multiple helix-loop-helix structural 
motifs populate subunits RESC7-12 (fig. S11), 
with RESC8 to RESC12 forming extended super- 
helical HEAT domains. RESC13’s N-terminal 
RGG and C-terminal RRM motifs, both impli- 
cated in mRNA binding (40), are present in the 
structure of mock-treated RESC-B; only the RGG 
domain is visible in RNase-treated RESC-B. The 
RESC14 phytanoyl-CoA dioxygenase pseudo- 
enzyme’s phyH domain is wedged between 
RESC6, RESC8, and RESC10 (Fig. 4A and fig. 
$12, C and D). 

Two RNA strands are embedded in RESC-B. 
The shorter fragment contacts RESC5, RESC6, 
and RESC10 (Fig. 4, A and B, and fig. S12C), 
with nucleotides -16 to -12 (we designate po- 
sition -1 as the 3’ end of visible RNA) being 
stacked between RESC5 and RESC6 (Fig. 4D), 
which mimics gRNA binding in RESC-A (Fig. 


3 of 10 


RESEARCH | RESEARCH ARTICLE 


RESC6 / RESC5 


‘hook’ / RESC1 


hydrophobic mainchain 
interaction hydrogen bonds 
ae E RESC6 RESC5 


superhelical 


. crevice 
inner surface 


a so 
180° 180° 
pv 


¥ ye ——$_—_—- VF 
RESC2 / RESC6 / RESC5 


/ 


Fig. 2. Structure of RESC-A protein complex. (A) Perpendicular views of the overall RESC-A assembly. 
(B) Magenta stars point to close-up views of interacting regions in the RESC1/2 heterodimer, detailing 
interactions between the ARM domains of RESC2 and RESC1 (top left) and between the RESC2 “hook” 
domain and RESC1 (right). Yellow patches color hydrophobic interactions with RESC2, presented as a strand, 
and RESC1, shown as a surface. (©) The C-terminal loop of the RESC2 ARM domain engages the inner 
surface of the RESC6 HEAT domain superhelix. Magenta stars point to expanded views of amino acid residues 


in the contacting regions. (D) Side-chain interactions 
potential of the RESC5/6 heterodimer. 


3, C and D). Therefore, this nucleotide strand 
was tentatively designated gRNA. In contrast 
to RESC-A, the 5’ gRNA nucleotides exiting 
from RESC5/6 are not visible in RNase-treated 
RESC-B, whereas the 3’ nucleotides extend fur- 
ther and interact with RESC10 (Fig. 4, A and 
D). Comparison of RESC-A and RESC-B sug- 
gests gRNA transition from a hairpin in RESC-A 
(Fig. 3, B to D) into single-stranded conforma- 
tion in RESC-B (Fig. 4, A, B, and D). The en- 
during gRNA attachment to RESC5/6 during 
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between RESC5 and RESC6. (E) Electrostatic surface 


remodeling renders the stability of this inter- 
action as a quality checkpoint for tunneling of 
properly 3'-processed molecules into the edit- 
ing cascade. 

RNA strand modeling and in vivo RNA- 
protein cross-linking data suggest that mRNA 
fragments account for the longer nucleotide 
segment traversing the entire RESC-B and 
interacting with all proteins (Fig. 4, A, C, E, H, 
and I, and fig. S12, D to F). The unmodeled 5’ 


moiety runs through the interval between the 


RGG and RRM domain of RESC13 and further 
associates with the inner surface of RESC11’s 
HEAT superhelix (Fig. 4, F and G). Heading 
toward the 3’ end, the mRNA is sequentially 
shielded by the superhelical inner surface of 
the HEAT repeats of RESC9, RESC8, and RESC6. 
This shielding is consistent with residual RNA 
moiety in the RNase-treated RESC-B (Fig. 
1G, middle). In RESC-B, the mRNA’s 3’ region 
is positioned in proximity (~30 A) to the 5’ 
end of the single-stranded gRNA fragment 
unwound by RESC5, RESC6, and RESC10 (Fig. 
4B). This mutual orientation potentially al- 
lows gRNA and mRNA to hybridize beyond 
the RESC-B surface. 


gRNA-mRNA duplex scaffolding by RESC-B 


Our cryo-EM structures show gRNA binding 
by RESC1/2 and RESC5/6 in RESC-A, and a 
continued gRNA engagement with RESC5/6 
and additional interactions with RESC10 in | 
RESC-B, all subunits of which also contact 
mRNA. However, sequence diversity of the en- 
dogenous transcripts impeded identification 
by cryo-EM of RNAs bound to each protein. To 
directly identify RNAs bound by individual 
RESC components, we carried out UV cross- 
linking of live parasites and purification of ‘ 
RNA-protein cross-links, RNA fragmentation, 
and sequencing (figs. Sl and S15A and table S5) 
(42). Consistent with our structures, RESC1 
and RESC2 (RESC-A) display strong prefer- 
ence for gRNAs; RESC5, RESC6 (RESC-A and 
RESC-B), and RESC10 (RESC-B) cluster in the 
intermediate range commensurate with dual 
gRNA-mRNA binding modality; RESC-B-specific 
RESC7 to RESC14 predominantly cross-link to 
mRNAs (Fig. 4H). These data confirm the iden- 
tity of the nucleotide strand held by RESC5/6/10 
as gRNA (Fig. 4D) and the strand traversing the 
entire RESC-B as mRNA (Fig. 4A). 

By forming an ~11-bp “anchor” duplex, which 
lengthens as U insertions and deletions extend 
complementarity with mRNA, gRNA initially 
recognizes cognate mRNA. Considering the 
lack of gRNA-mRNA pairing within RESC-B, 
we next assessed editing prevalence in mRNA 
fragments cross-linked to RESC-B proteins. 
The ratios of fully edited sequences to all reads 
derived from pan-edited mRNAs meeting cov- 
erage thresholds, uS12m (Fig. 41, A6, and CO3, 
and from moderately edited CYB mRNA (fig. 
SI15B and table S5), demonstrate that RESC12/ 
12A paralogs and RESC13 display strongest 
preference for pre-edited mRNAs. The extent 
of editing increases in RESC14, whose sup- 
pression spares initiation but blocks editing 
progression (43, 44). As expected from our 
structures, edited sequences are most repre- 
sented among RNAs cross-linked to gRNA- 
mRNA-binding RESC5, RESC6, and RESC10 
(Fig. 41). Longitudinal mapping of RESC5 
and RESC12 (Fig. 4J) and other RESC-B pro- 
teins (fig. S16) in vivo occupancies of uS12m 
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Fig. 3. Structure of RESC-A complex with gRNA. (A) The gRNA binding 
RESC1/2 and RESC5/6 heterodimers and gRNA are highlighted in the overall 
RESC-A structure. (B) Disordered loops in RNA-depleted RESC-A become 
structured and visible with bound gRNA. (C) The 3' terminus of gRNA occupies 
the positively charged crevice formed by RESC5 and RESC6. (D) Interactions 
between gRNA and gRNA-binding proteins. An atomic model was built for the 
first three nucleotides, with the triphosphate group visible on the first nucleotide. 
The rest of gRNA moiety was traced with the main chain. The model overlayed 
on transparent cryo-EM density map was shown for the first nucleotides of 
gRNA (bottom left) and the gRNA stem and the 3' fragment (right). m, median 
length; r, range. (E) Superimposition of triphosphate tunnel metalloenzyme 
(TTM) domains from RESC2 (brown), RESC1 (turquoise), and S. cerevisiae 

RNA triphosphatase Cetl (yellow). (F) Focused views of RNA 5' triphosphate- 
binding and catalytic metal ion—coordination sites in RESC2, RESC1, and Cett. 
(G) Multiple sequence alignment of the TTM domains in RESC2 (R2) and 
RESC1 (R1). Red background depicts identical residues. Conserved residues 


Liu et al., Science 381, eadg4725 (2023) 7 July 2023 


RESC1 RESC2 RESC5 RESC6 


RESC1 RESC2 RESC5 RESC6 RESC1 RESC2 RESC5 RESC6 


are colored in red, and divergent residues are in black. gRNA-binding residues 
in RESC2 are labeled with stars; underlined red stars indicate RESC2 
triphosphate—binding side chains that are divergent in RESC1. Cetl residues 
structurally homologous to RESC2 positions are numbered; blue circles indicate 
catalytic metal-binding triad. (H) Growth kinetics of parasite cell lines that 
conditionally express mutated RESC2 proteins. (Insets) RESC2 expression in 
mock (-tet) cells and cells that were tetracycline-induced for 72 hours (+tet). 
KRET]I, loading control. (1) Size distribution of gRNA fragments cross-linked 

to RESC1, RESC2, RESC5, and RESC6. ****q < 0.0001, Welch's t test followed 
by Benjamini-Hochberg (BH) false discovery rate (FDR) correction. 
(J) Distribution of average U-tail length in gRNAs fragments cross-linked to 
RESC1, RESC2, RESC5, and RESC6. ****q < 0.0001; ns, not significant. Mann- 
Whitney U test, followed by BH FDR correction. (K) Ratios of gRNAs versus 
minicircle-encoded gRNA-like molecules cross-linked to RESC-A subunits. Single- 
letter abbreviations for the amino acid residues are as follows: A, Ala; E, Glu; 

K, Lys; R, Arg; N, Asn 
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Fig. 4. gRNA and mRNA trajectories in RESC-B complex. (A) Overall 

RESC-B structure. (Left) gRNA-interacting proteins RESC5, RESC6, and RESC10 
and gRNA moiety. (Right) mRNA-binding proteins RESC5, RESC6, RESC7, 
RESC8, RESC9, and RESC11 and the mRNA reconstitution. (B) Binding of gRNA 
and mRNA fragments by RESC5, RESC6, and RESC1O. (C) Binding of the 
mRNA's 3' region by RESC5, RESC6, RESC8, and RESC14. (D) Electrostatic 
interactions between gRNA and RESC5, RESC6, and RESC10. Blue surfaces show 
positively charged grooves. (E) Interactions between mRNA and RESC-B 
residues. Close-up views show the detailed interaction of RESC7, RESC9, 

and RESC13 with mRNA positions 101 to 103, and those of RESC6, RESC8, and 
RESC14 with positions 113 to 117. (F and G) Potential architecture of mRNA 
5'-end binding site shaped by RESC11 and RESC13. (F) RESC11 is shown as 
electrostatic surface potential, whereas RESC13 is shown as ribbon. (G) RESC11/ 
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RESC13 are shown as ribbon, whereas unmodeled mRNA is shown as 
transparent gray density. Double-dashed line indicates positively charged 
surface area. (H) In vivo gRNA-mRNA binding preferences of RESC proteins. 
Reads were mapped to annotated gRNAs (35) and to 12 edited mRNA genes in 
all editing states. Read-count ratios are log2-transformed and centered by the 
average of all RESC proteins. (I) Prevalence of fully edited uSl12m (RPS12) 
mRNA sequences detected with in vivo UV cross-linking analysis of RESC-B 
proteins. The ratio of fully edited to combined pre-, partially, and fully edited 
mRNA fragments is plotted for each RESC-B subunit. (J) Positional mapping of 
RESC5 and RESC12 in-vivo binding sites in uS12m (RPS12) mRNA. The x axis 
represents 5'-to-3' mRNA coordinates after removing encoded Us and those 
inserted by editing (T-less). The y axis shows the total read depth for pre-edited 
(blue), U-insertions (orange), and U deletions (green) per nucleotide. 
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mRNA further corroborates their aggregate 
binding preferences ranging from mostly 
pre-edited (RESC11/12/13) to partially edited 
(RESC5/6/10) sequences. 

A gRNA-mRNA duplex protruding from the 
RESC5/6/10 cluster would be expected from 
structural and RNA profiling data. Local 3D 
classification of RESC-B particles indeed shows 
an extended gRNA-mRNA duplex (Fig. 5 and 
fig. S10J). The resultant reconstitutions traced 
a nearly complete 43-nt gRNA exiting from 
RESC-B and forming a 24-bp duplex with the 
3' part of the contiguous 51-nt mRNA fragment 
(Fig. 5A). 

In typical gRNA, the guiding part is sepa- 
rated from the U tail by a few nucleotides that 
cannot pair with mRNA (35). Mechanistically, 
sequestering gRNA’s “information-poor” 3’ 
end by RESC5/6/10 allows guiding and anchor 
parts to hybridize with mRNA beyond RESC-B 
surface, hence, accessible to RECC (Fig. 5C). It 
follows that RESC5/6/10 likely retains the U 
tail and adjacent nonguiding nucleotides as 
editing events take place between the 5’ an- 
chor and gRNA’s 3’ end. Because the mRNA is 
cleaved at each editing site, RESC-B contacts 
with single-stranded gRNA 3’ and mRNA 5’ 
regions likely tether mRNA cleavage fragments 
to gRNA. As documented by “precleaved” edit- 
ing assays, such tethering stimulates U inser- 
tion, U deletion, and RNA ligation activities 


A 3151) 5043) 


gRNA—mRNA 
duplex 
(24bp) 


mee, RESC11 


(45-47). Collectively, our findings introduce 
RESC-B as the probable substrate for the cat- 
alytic RECC complex. Upon completion of edit- 
ing directed by a single gRNA, RESC-B may also 
expose the double-stranded region formed by 
gRNA and fully edited mRNA to postediting 
transactions. 


Discussion 


By using the cryoID approach, we have iden- 
tified and built atomic models of three RNA- 
editing substrate-binding complexes—RESC-A, 
RESC-B, and RESC-C—from T: brucei mitochon- 
drion. Information stored in minicircle and 
maxicircle genomes conflates during U in- 
sertion and deletion editing, whereby gRNAs 
produced from the former direct posttranscrip- 
tional recoding of cryptic transcripts encoded 
in the latter. The RESC molecular machines 
pairing hundreds of gRNAs with their targets 
are key to this broadly important process. In 
RESC-A composed of RESC1 to RESC6, RESC2 
pseudotriphosphatase sequesters gRNA’s 5’ 
end into the triphosphate binding tunnel. Ho- 
mologous binding partner RESC1 attaches 
RESC2 to RESC-A, wherein RESC5 and RESC6 
affix gRNA’s 3’ end by selecting molecules with 
longer U tails. The ensuing hairpin formation 
and protein contacts protect gRNA against 3’- 
to-5' degradation in a plausibly inactive con- 
formation whereby the anchor hybridizes with 


RESC11 


Cc pre-edited mRNA ee 


the guiding region. Such “closed” conforma- 
tion implies that gRNA remodeling precedes 
mRNA recognition. In RESC-B, composed of 
RESC5 to RESC14, we observed an mRNA frag- 
ment traversing the entire complex. An exten- 
sive hydrogen-bonding network fastens mRNA 
to every protein and positions its 3’ part close 
to a single-stranded gRNA segment unwound 
by interactions with RESC5, RESC6, and RESC10. 
Smaller RESC-C also contains gRNA and mRNA; 
whether this abundant particle represents a sta- 
ble intermediate in the transition from RESC-A 
to RESC-B, or reflects RESC-B disassembly dur- 
ing postediting remodeling—e.g., pre-edited 
mRNA moving out of RESC12/12A—remains 
to be established. 

Consistent with our atomic models, in vivo 
RNA-occupancy profiling discerned RESC com- 
ponents into preferential gRNA (RESC1/2), 
dual gRNA-mRNA (RESC5, RESC6, and RESC10), 


and pre-edited mRNA (RESC7 to RESC9 and | 


RESC11 to RESC14) binding factors (Fig. 4H). 
We observed the least editing in mRNA frag- 
ments bound to RESC12 and RESC13, which are 
distal to gRNA in the RESC-B structure, and the 
most in those cross-linked to gRNA-proximal 
RESC5, RESC6, and RESC10 (Fig. 41). The ex- 
tent of U insertions and deletions gradually 
increases from the 5‘ mRNA section seques- 
tered by the RESC12/13 “entry” cluster toward 
the RESC5/6/10 “exit” module. Apparently, the 


fully edited mRNA 


RECC 3' a 
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Fig. 5. Overall architecture of gRNA-mRNA hybrid exiting RESC-B. (A and B) Flipped views of the 24-bp duplex between gRNA and edited mRNA protruding out 
of RESC5, RESC6, RESC8, and RESC10. Three-dimensional (3D) projection of the cryo-EM map in (B) shows the gRNA-mRNA duplex (black star) and putative 
RESC12/12A (dashed circle). This 2D class average shares the same view as the RESC-B structure shown underneath. (€) A model of RNA-editing substrate. 
Mismatches in the duplex between gRNA and pre-edited mRNA delineate U-insertion (+) and U-deletion (—) editing sites (left). Completion of editing events extends 
the double-stranded region initially formed by the anchor through the guiding gRNA parts until full complementarity is achieved (right). 
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RESC-B complex tethers gRNA’s 3’ end and 
pre-edited transcript to facilitate mRNA rec- 
ognition by the anchor. The resultant im- 
perfect mRNA-gRNA duplex protrudes from 
RESC-B surface (Fig. 5), exposing editing sites 
defined by a single gRNA to transiently bind- 
ing and dissociating insertion- and deletion- 
specific RECC complexes (48, 49) and, possibly, 
to postediting transactions. For example, dur- 
ing pan-editing that involves multiple over- 
lapping gRNAs, the initiating gRNA directs 
sequence changes that create a binding site 
for the next one (6). However, initiating gRNA 
must be dislodged from the edited mRNA be- 
fore the next one anneals, and this may involve 
RESC-A and RESC-B cycling or RESC-B re- 
duction into RESC-C. It is also plausible that 
translation initiation would require gRNA 


RESC1 


clearing from the initiation codon created by 
editing (50, 57). 

Because RNA processing complexes typical- 
ly undergo transient rearrangements imple- 
mented by molecular motors, the structures of 
stable particles suggest that RESC-A remodel- 
ing into RESC-B extracts gRNA’s 5’ end from 
the RESC2 triphosphate tunnel and ejects sub- 
units RESC1-4 to unfold the gRNA hairpin. To 
transform RESC-A to RESC-B, the following 
displacements ought to occur on the basis of the 
structural comparison by shared components 
RESC5 and RESC6 (Fig. 6A, left): RESC10 and 
RESC14 will roughly occupy the position of 
RESC2 and RESC1, respectively (Fig. 6A, mid- 
dle); RESC7 and RESC8 will replace RESC3 
and RESC4 (Fig. 6A, right); and RESC2 dis- 
placement yields the superhelical inner surface 


of RESC6, which binds to mRNA’s 5’ segment 
in RESC-B (Fig. 6B). 

Integration of biochemical data and struc- 
tural analyses enables understanding of RESC 
assembly and rearrangements during the edit- 
ing process (Fig. 6C). Separation of RESC2- 
associated complexes detects heterogeneous 
species with apparent molecular masses from 
~280 kDa to ~1.2 MDa (Fig. 1) and demon- 
strates that the most abundant ~280 kDa par- 
ticle lacks gRNAs. This correlates with the 
RESC-A cryo-EM structure, indicating that 
gRNA is sequestered into RESC-A during as- 
sembly from RESC1/2, RESC5/6, and RESC3 
and RESC4 (Fig. 6C, left middle). Our results 
demonstrate a prominent RNA-dependent 
interaction between RESC-A and KREH2 RNA 
helicase, implicating this molecular motor in 


gRNA 

(RESC-B) 
mRNA 

(RESC-B) 


RESC14 
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Fig. 6. Model of transition from RESC-A to RESC-B. (A) Potential clashes resolved by remodeling of RESC-A to RESC-B. RESC5 and RESC6 are well aligned in both 
complexes (left); RESC14/10 in RESC-B replace RESC1/2 in RESC-A (middle), and RESC7/8 in RESC-B replace RESC3/4 in RESC-A (right). RESC9 and RESC12 
positioned in RESC-B midsection do not collide with RESC-A proteins. (B) Comparison of RESC-A and RESC-B highlights the preservation of RESC5/6 (surface) and 
repositioning of gRNA’s single-stranded 3’ end (ribbon). RESC-B proteins are not shown for better visualization. (©) Proposed model of gRNA-stabilizing RESC-A 
assembly and KREH2C-induced remodeling into the editing-competent RESC-B. 
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RESC-A assembly (Fig. 1, A to F). Conversely, 
undetectable KREH2 levels in RESC9 and 
RESC14 complexes suggest that the helicase 
dissociates from RESC-A during remodeling 
into RESC-B (table S1). In RESC-A, gRNA an- 
chor and guiding regions fold into a hairpin, 
whereas the uridylated 3’ end is affixed in the 
crevice between RESC5 and RESC6 (Fig. 6C, 
left middle). A single-stranded 3’ end is re- 
quired for recognition by KREH2, which is 
capable of unwinding double-stranded RNA 
(52, 53) and may unfold the hairpin to ini- 
tiate RESC-A remodeling into RESC-B (Fig. 
6C, right middle). In this scenario, recognition 
by KREH2 may select RESC5/6-bound gRNAs 
with longer U tails (Fig. 3J) as a quality check- 
point of RESC-A assembly. Collectively, our 
findings highlight KREH2 helicase as the prob- 
able remodeler facilitating the transition from 
RESC-A to RESC-B. We postulate that RESC-B 
is the editing-competent substrate of the RECC 
catalytic complex, for which our structures re- 
veal an unobstructed approach path to the 
gRNA-mRNA hybrid. 


Materials and methods summary 
Purification of the T. brucei RESC complex 


For protein affinity purification and enhanced 
in vivo UV cross-linking/affinity purification/ 
RNA sequencing (eCLAP), we introduced a CTS 
tag (combinatorial affinity tag consisting of 10 
histidines, protein C peptide epitope, and Twin- 
Strep peptide) (fig. SIA) at the C-termini of 
established RESC proteins (8). Protein-RNA ad- 
ducts were purified by means of sequential 
streptavidin and metal affinity pulldowns and 
SDS-PAGE. RNA fragments were released 
through protease digestion and sequenced, as 
described in supplementary materials. Protein 
complexes were purified with tandem affinity 
chromatography from enriched mitochondrial 
fraction of procyclic (insect) form of T. brucei 
Lister 427 strain and analyzed as described in 
the supplementary materials. 


EM grid preparation, imaging, data processing, 
and modeling 


Cryo-EM grids of both RNase-treated and 
mock-treated samples were plunge-frozen into 
precooled, liquefied ethane using Vitrobot 
Mark IV, screened with FEI TF20, and imaged 
by using an FEI Titan Krios electron micro- 
scope. Data processing of the RNase-treated 
dataset yielded three final maps with av- 
erage resolution (FSC 0.143) of 3.4, 3.7, and 
3.3 A, respectively. These maps were identi- 
fied through the CryoID approach and mod- 
eled as RESCI1-6-containing RESC-A (3.7 A), 
RESC5-14-containing RESC-B (3.4 A), and 
RESC5-8,10,14-containing RESC-C (3.3 A). In 
combination with the mock-treated RESC-A/ 
B/C, we were able to identify a “closed” gRNA 
in RESC-A and gRNA-mRNA duplex in RESC- 
B/C. Details of EM grid preparation, imaging, 
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data processing, and modeling are described 
in the supplementary materials. 
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STRUCTURAL BIOLOGY 


Principles of human pre-60S biogenesis 


Arnaud Vanden Broeck* and Sebastian Klinge* 


INTRODUCTION: Ribosomes, two-subunit RNA- 
protein nanomachines, translate messenger 
RNA (mRNA) into proteins in all living or- 
ganisms. The small ribosomal subunit (40S) 
is responsible for decoding of mRNA, where- 
as the large subunit (60S) catalyzes peptide 
bond formation. Assembly of both ribosomal 
subunits in human cells is initiated in the nu- 
cleolus, followed by nuclear and cytoplasmic 
maturation, and requires more than 200 ribo- 
some assembly factors catalyzing modifica- 
tion, processing, and folding of ribosomal 
RNA (rRNA). During nucleolar assembly, the 
large ribosomal subunit precursor (pre-60S) 
is assembled from a 5S rRNA and a 32S pre- 
rRNA precursor, the latter containing the 5.8S 
rRNA and the 28S rRNA joined through an 
internal transcribed spacer 2 (ITS2). Because 
of an inability to isolate early endogenous pre- 
60S assembly intermediates, insights into hu- 
man large ribosomal subunit assembly are 
so far limited to very late nuclear states. The 
mechanism underlying nucleolar assembly 
and nuclear maturation of human pre-60S 
particles thus remains unknown. 


RATIONALE: We set out to elucidate the early 
stages of human pre-60S assembly at a mech- 
anistic level. To this end, we have combined 


Principles of human pre-60S biogenesis. 
Human pre-60S assembly pathway with 

crown view of states A to L. During nucleolar 
assembly, states A to C uncover the formation 
of the peptidyl transferase center (PTC) in 
two stages (PTC-I and PTC-Il). Toward nuclear 
maturation, states F to H highlight major 
remodeling events to accommodate the 
immature central protuberance (CP) 
containing the unrotated 5S rRNA. The 
transition from state H to | reveals the 
coupling of 5S rotation to ITS2 processing. 
Late nuclear maturation steps illustrated by 
states J to L include the formation of the 

E site and fine tuning of the CP and PTC. 
Newly folded rRNA elements are color-coded. 
Key assembly factors or ribosomal proteins 
are shown. 
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human genome editing and biochemistry to 
permeabilize human nucleoli and nuclei to 
isolate intact endogenous pre-60S assem- 
bly intermediates for subsequent structural 
characterization by cryo-electron micros- 
copy (cryo-EM). To functionally study rRNA 
processing, we developed an in vivo recom- 
binant ribosome assembly assay using an 
engineered human rDNA locus to investigate 
whether rRNA elements within ITS2 and the 
28S rRNA are required for large ribosomal sub- 
unit biogenesis. 


RESULTS: In this study, we biallelically affinity 
tagged the ITS2-associated assembly factor 
MK67I and determined 24 cryo-EM structures 
of human pre-60S assembly intermediates at 
resolutions of 2.5 to 3.2 A. Within this struc- 
tural landscape, we observed several parallel 
assembly pathways and identified eight main 
nucleolar assembly states and four main nu- 
clear maturation states. In the nucleolus, the 
structures highlight how protein interaction 
hubs tether assembly factor complexes to the 
maturing pre-60S particles and how GTPases 
and ATPases, such as DEAD-box RNA helicases, 
can couple irreversible nucleotide hydrolysis 
steps to the installation of functional cen- 
ters. After exiting the nucleolus, the maturing 


Nucleolus s 


Nucleus 
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pre-60S particles demonstrate how the st ne 
tural plasticity of the pre-rRNA is interrogu.-— 
by an ensemble of assembly factors including 
the rixosome to couple rRNA conformational 
changes with RNA exosome-mediated degra- 
dation of ITS2. Later nuclear pre-60S states 
reveal how assembly factors interchange is 
used to drive irreversible fine tuning of func- 
tional centers. Additionally, by using engineered 
human rRNAs, we have identified elements of 
ITS2 and the 28S rRNA that are critical to gen- 
erating mature large ribosomal subunits in hu- 
man cells. The high resolution of our cryo-EM 
reconstructions further allowed us to visualize 
many previously mapped chemical modifica- 
tions of the human 28S rRNA, providing a 
foundation with which to study both univer- 
sally conserved chemical modifications as well 
as species-specific adaptations as a function of 
ribosome assembly. 


CONCLUSION: Our ensemble of cryo-EM struc- : 
tures, together with our engineered ribosome 
assembly assay, now provide a high-resolution 
perspective on nucleolar assembly and nuclear 
maturation of the human large ribosomal sub- 
unit. Our study rationalizes decades of genetic 
and biochemical data and reveal new princi- ‘ 
ples of eukaryotic ribosome assembly. 
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Principles of human pre-60S biogenesis 


Arnaud Vanden Broeck* and Sebastian Klinge* 


During the early stages of human large ribosomal subunit (60S) biogenesis, an ensemble of assembly 
factors establishes and fine-tunes the essential RNA functional centers of pre-60S particles by an 
unknown mechanism. Here, we report a series of cryo—electron microscopy structures of human 
nucleolar and nuclear pre-60S assembly intermediates at resolutions of 2.5 to 3.2 angstroms. These 
structures show how protein interaction hubs tether assembly factor complexes to nucleolar particles 
and how guanosine triphosphatases and adenosine triphosphatase couple irreversible nucleotide 
hydrolysis steps to the installation of functional centers. Nuclear stages highlight how a conserved RNA- 
processing complex, the rixosome, couples large-scale RNA conformational changes with pre-ribosomal 
RNA processing by the RNA degradation machinery. Our ensemble of human pre-60S particles provides 
a rich foundation with which to elucidate the molecular principles of ribosome formation. 


rotein synthesis in all cells requires 

ribosomes, two-subunit RNA-protein 

molecular machines in which the RNA 

component is responsible for both de- 

coding of messenger RNA within the 
ribosomal small subunit (40S) as well as pep- 
tide bond formation within the ribosomal large 
subunit (60S). Since the discovery of ribosome 
assembly in human cells, >200 ribosome assem- 
bly factors have been identified that catalyze 
the modification, processing, and folding of 
ribosomal RNA (rRNA), initially in the nucle- 
olus and subsequently in the nucleus and cyto- 
plasm (J-3). The large ribosomal subunit rRNAs 
(5S, 5.8S, and 28S rRNA) are generated from two 
transcripts: a transcript containing the 5S rRNA 
and a separate 47S pre-rRNA that contains the 
5.8S and 28S rRNA in addition to the rRNA of 
the small subunit. After the transcription of the 
47S pre-rRNA in the nucleolus and a cleavage 
reaction separating precursors of both sub- 
units, the first stable nucleolar large subunit 
pre-rRNA is a 32S precursor in which the 5.8S 
and 28S rRNAs are joined by internal transcribed 
spacer 2 (ITS2). 

Traditionally, eukaryotic 60S assembly has 
been studied in the yeast system [see (4) and 
references therein and (5-J0)], which has 
offered several molecular snapshots of early 
nucleolar assembly intermediates from Sac- 
charomyces cerevisiae (11-13). However, the 
mechanisms that drive early assembly forward 
remain unknown. The absence of key transition 
states has so far precluded a mechanistic under- 
standing of how essential energy-consuming 
enzymes such as adenosine triphosphatases 
(ATPases), including DEAD/H-box helicases 
and AAA+-ATPases, as well as guanosine triphos- 
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phatases (GTPases) can bring about irreversible 
transitions and the formation of catalytic 
centers of the eukaryotic large ribosomal sub- 
unit, such as the peptidyl transferase center 
(PTC) (14, 15). Thus, the mechanisms that 
enable early eukaryotic large ribosomal sub- 
unit assembly to proceed in a controlled and 
unidirectional manner remain poorly under- 
stood. Toward the end of nucleolar and begin- 
ning of nuclear maturation, pre-60S assembly 
intermediates already contain functional cen- 
ters, including the PTC, the GTPase-associated 
center (GAC), the L1 stalk, as well as a central 
protuberance (CP) containing an unrotated 5S 
ribonucleoprotein (RNP; Nog? particle state 1) 
(16). However, these functional centers have 
not yet reached their mature conformations, 
requiring an ensemble of nuclear ribosome 
assembly factors for their completion. Although 
several structures covering nuclear stages of 
large ribosomal subunit assembly in yeast 
have been described (5-7, 16, 17), major gaps 
in our mechanistic understanding of this pro- 
cess remain. An essential transition during 
nuclear pre-60S maturation is the rotation of 
the 5S RNP during conversion of a Nog2- 
bound particle (76) to a Rix1-Midasin bound 
particle (7), which is catalyzed by Midasin 
(Mdn1). Only after the rotation of the 5S RNP 
does ITS2 processing result in the mature 5.8S 
rRNA, but how these two irreversible events 
are linked has remained unclear. Consistent 
with functional coupling between 5S RNP 
rotation and ITS2 processing, mutations or 
depletions of assembly factors involved in 5S 
RNP rotation or ITS2 processing result in 
the accumulation of either unprocessed pre- 
rRNA or 5.8S precursors containing parts of 
ITS2 (18-22). The accumulation of the same 
pre-RNA species is also observed upon deple- 
tion of the endonuclease cleaving ITS2 (yeast 
Las1, human LASIL) and components of the 
RNA exosome, respectively (23, 24). Although 
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these data suggest coupling of 5S RNP rotation 
with ITS2 cleavage and processing by Las1/LASIL 
and the RNA exosome, the molecular basis of 
this mechanism remains elusive. 

Insights into human large ribosomal sub- 
unit assembly are still in their infancy, because 
only a few structures of very late stages of pre- 
60S maturation have been obtained through 
the ectopic expression of an affinity-tagged nu- 
clear export factor (25). Because of the bio- 
chemical properties of the human nucleolus, 
an inability to obtain endogenous early pre-60S 
assembly intermediates has so far limited our 
understanding of the molecular mechanisms 
responsible for their unidirectional assembly 
and maturation. Furthermore, within the hu- 
man nuclear ribosome assembly pathway, it is 
unclear how an evolutionarily expanded multi- 
protein complex, the rixosome (26), can inte- 
grate different functional activities to ensure 
that cleavage of ITS2 by LASIL (27) and the 
subsequent RNA exosome-mediated process- 
ing of ITS2 are efficiently coupled. 

From an evolutionary perspective, the 32S 
pre-rRNA of the late nucleolar intermediates 
in humans is much larger than its yeast 27S 
counterpart because of the presence of seve- 
ral expansion segments in ITS2 and the 28S 
rRNA, and it is currently unknown if these 
expansions are required for ribosome assembly. 

In this study, we have combined human 
genome editing and biochemistry to permea- 
bilize human nucleoli and nuclei to determine 
high-resolution cryo-electron microscopy (cryo- 
EM) structures of 24 maturing human pre-60S 
particles at resolutions of 2.5 to 3.2 A. Of these, 
eight main nucleolar structures elucidate how 
multiple levels of regulation are used to drive 
unidirectional 60S maturation by integrating 
tethered multiprotein complexes with ATPases, 
such as DEAD-box RNA helicases, and GTPases 
that catalyze key transitions during the assem- 
bly of functional centers. Four main nuclear 
structures reveal the mechanism by which 5S 
RNP rotation is coupled to ITS2 processing 
and how the human rixosome plays a central 
role in mediating these steps. By using engi- 
neered rRNAs in human cells, we have iden- 
tified elements of the human ITS2 and 28S 
rRNA that are critical for ribosome assembly 
and nuclear export. 


Cryo-EM structures of maturing human 
pre-60S particles 


To obtain structural insights into early stages 
of human large ribosomal subunit assembly, 
we have biallelically affinity tagged the endoge- 
nous ITS2-associated assembly factor MK67I 
(fig. S1, A to D). The biochemical affinity puri- 
fication of MK67I enabled the direct isolation 
of human pre-60S particles containing ITS2 
for cryo-EM analysis. These nucleolar and nu- 
clear stages of assembly are associated with 
the expected nucleolar (32S) and nuclear (28S, 
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12S, and 8S) pre-rRNAs (figs. S1, E to I, and 
$2). A total of 172,699 cryo-EM images were 
collected on a Titan Krios microscope equipped 
with a K3 detector. After initial data curation 
and extensive three-dimensional classifica- 
tion, we identified 12 nucleolar stages of as- 
sembly, of which the main eight states are 
here referred to as A to H, and 12 nuclear 
stages of assembly, of which the four main states 
are referred to as I to L (figs. S3 to S5). Focused 
classifications and refinements were used to 
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obtain high-resolution composite maps (2.5 to 
3.2 A) (figs. S6 to S15 and tables S1 and 82). 
These reconstructions enabled the assignment 
and precise model building of human pre-rRNAs 
(5.88, ITS2, 28S, and 5S rRNA), 41 ribosomal 
proteins, and 47 nucleolar or nuclear ribosome 
assembly factors (tables S3 and S4). Differences 
in the pre-rRNA folding states and the presence 
or absence of distinct assembly factors enabled 
the assignment of states A to H and states I to 
L into nucleolar and nuclear assembly path- 
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ways, respectively (Fig. 1, figs. S16 to S19, and 
movie 1). 

We have identified parallel assembly path- 
ways at the beginning of the nucleolar assem- 
bly (states A to D) characterized by the presence 
or absence of the MAK16-associated assembly 
factor complex [MAK16, WDR74 (yeast Nsal), 
RPFI, RRP1 and the GTP-binding domain of 
GNL3 (yeast Nug1)] that are removed by NVL2 
(15) (figs. S16 and S17). We also observed par- 
allel maturation pathways with the stepwise 
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Fig. 1. Nucleolar assembly and nuclear maturation of human pre-60S particles. Simplified human pre-60S assembly pathway with crown view of atomic models 
of states A to L with ribosomal proteins in gray and color-coded pre-rRNAs and ribosome assembly factors. Proteins that became ordered or departed between 


different assembly states are indicated by arrows. 
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maturation of ITS2 elements from states I to 
Lduring nuclear stages of assembly (figs. S18 
and S19). The ITS2-centered events can occur 
concurrently with other maturation events 
that focus on the fine-tuning of functional 
centers of progressing nuclear pre-60S parti- 
cles. These include the cooperative formation 
of the E site (states J), the progressive fine- 
tuning of the PTC through irreversible ex- 
change of assembly factors (states K), and the 
maturation of the P site with the installation 
of uL16 (states L). 

The scale of our cryo-EM dataset, which con- 
tains ~15 million particles, of which ~2.4 million 
particles were chosen, allowed us to observe 
the progression of the human large ribosomal 
subunit assembly as a fine-grained series of 
molecular snapshots that enable a precise 
mechanistic explanation of how ribosome as- 
sembly factors act in concert, the identifica- 
tion of human-specific adaptations, and the 
visualization of covalent modifications of pre- 
rRNA as a function of ribosome assembly. 
This ensemble of states now provides a high- 
resolution perspective on nucleolar assembly 
and nuclear maturation of the human large 
ribosomal subunit as described below. 


High-level organization and hierarchy of 
pre-rRNA and protein incorporation 


A unique principle of human pre-60S assem- 
bly is the early establishment of hubs that 
organize architectural pre-rRNA elements 


28S root helices 


and tether ribosome assembly factor com- 
plexes to coordinate different steps of assembly 
(Fig. 2). 

rRNA root helices, which form the origin of 
each of the six subdomains (I to VI) of the 28S 
rRNA, are already positioned in the first visual- 
ized posttranscriptional assembly state (state A) 
(fig. S20). The early establishment of root helices 
around the assembly factor complex SURF6- 
SSF1-RRP15 assists in the correct positioning 
of each rRNA domain, thereby facilitating sub- 
sequent folding events that occur either within 
a domain or across domains (Fig. 2A). Ribo- 
some assembly factors are not only responsible 
for the correct placement of central architec- 
tural rRNA elements within state A, but also 
prevent premature folding of key rRNA elements 
(figs. S21 and S22). 

Comparable to the organization of central 
elements of pre-rRNA, a protein-protein inter- 
action hub is established in state A near 
domain I, which tethers ribosome assembly 
factor complexes to the maturing particles, 
thereby allowing for progression through sub- 
sequent stages of assembly (Fig. 2B). This protein- 
protein interaction hub is formed around an 
N-terminal region of BOP! to which an extended 
BRX1-EBP2 complex and peptide elements of 
FTSJ3 and RRSI are bound. 

Previous studies have highlighted that eu- 
karyotic ribosome assembly is controlled 
through both the TOR and casein kinase II 
pathways (28, 29). We have identified two 


| Assembly factor hub 


Fig. 2. Early clustering of pre-rRNA and assembly factor modules. (A) Crown view of state A showing 


color-coded root helices of domains | to VI and assem 


bly factors. (B) Crown view of state A highlighting a 


protein-protein interaction hub with color-coded assembly factors. 
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mammalian-specific posttranslational modifi- 
cations in the N-terminal segment of BOP1 
(phosphoserines 126 and 127), which are present 
within a sequence motif resembling casein 
kinase II substrates (30). The central location 
of these modified residues, which are present 
in all states containing BOP! (states A to F), 
raises the possibility that this set of post- 
translational modifications enables modula- 
tion of human ribosome biogenesis within 
cellular metabolism through signaling path- 
ways (fig. S22B). 

During nucleolar assembly stages covering 
states A to F, the protein-protein interaction 
hub (Fig. 2B) performs three major organizing 
functions. First, the presence of a peptide of the 
RNA 2’-O-methyltransferase FTSJ3 (yeast Spb1) 
suggests that tethering enables 2’-O-methylation 
of 28S rRNA as well as recruitment of the NOC2L- 
NOC3L complex and ribosomal proteins at dif- 
ferent stages of assembly (fig. S22B). By in- . 
teracting with all of these proteins, tethered 
ribosome assembly factors and ribosomal pro- 
teins are already present before their involve- 
ment in subsequent stages of assembly. Second, 
the presence of a C-terminal extension of RRS1 
provides a mechanism for recruitment and later 
incorporation of the L1 stalk and the 5S RNP 
through the RRS1-RPF2 complex that binds to 
5S rRNA and ribosomal proteins uL18 (RPL5) 
and uL5 (RPL11) (Fig. 1 and fig. S22D). Third, 
EBP2 has dual functions that include both the 
prevention of premature rRNA folding as well 
as the stabilization of the NSA2-GTPB4 complex 
in state B (Figs. 1 and 2 and figs. S21B and $22C). 

Collectively, the organization of pre-rRNA 
elements and ribosome assembly factor com- 
plexes in state A lays the foundation for many 
of the subsequent irreversible transitions that 
require the presence of already tethered assem- 
bly factors to efficiently stabilize new pre-rRNA 
folding states. 


GTPB4 catalyzes the initial installation of the 
peptidyl transferase center 


Key transitions during ribosome assembly are 
controlled through irreversible steps that in- 
volve nucleotide hydrolysis, RNA cleavage, or 
the generation of force (4, 37). An essential 
family of GTPases conserved in all domains of 
life that acts as a central regulator of large 
ribosomal subunit assembly includes the hu- 
man protein GTPB4 (Nog! in yeast, ObgE in 
Escherichia coli) (16, 32-34). So far, the limit- 
ing resolution of endogenous assembly inter- 
mediates obtained from any species and the 
lack of transition states of maturing particles 
has precluded a precise mechanistic understand- 
ing of the GTPase cycle of GTPB4 and its homo- 
logs during assembly. 

Whereas the C terminus of GTPB4 is already 
bound to eIF6 and RLP24 in state A (Fig. 3A), 
the guanosine diphosphate (GDP)-bound GTPB4 
GTPase domain first appears in state B and 


3 of 12 


RESEARCH | RESEARCH ARTICLE 


HN 
:MRT4 uL6 : 


ie GTPB4 fe 
fa GTPase iq 


Lid: 


State B 


GTPase Associated Center 


Fig. 3. GTPB4 drives early PTC installation. (A and B) Crown views of states A (A) and B (B) showing 
selected assembly factors. Bottom panels show a magnified view in which arrows show flexible elements of 
GTPB4, NSA2, and EBP2 in state A (A) before these are incorporated in state B (B). Assembly factors are 
color coded, and newly integrated PTC rRNA is represented as an orange surface. 


remains GDP bound in all subsequent nucle- 
olar and nuclear states that we observed (Figs. 
land 3B and figs. S23 and S24). Because state 
Bis the first state in which parts of the nascent 
PTC (helices 89 and 91) and the GAC (remod- 
eled ES13 and helices 42 to 44) appear, these 
data indicate that the incorporation of these 
RNA elements is coupled to the GTPase activity 
of GTPB4. The incorporation of GIPB4: after 
GTP hydrolysis is therefore similar to other 
enzymes with ATPase or methyltransferase 
activity (10, 35). Consistent with this model, 
conserved N-terminal residues of NSA2 that 
have been implicated in activating GTPase 
activity in yeast (36) are positioned near the 
GTPase active site (fig. S23B), and dominant- 
negative mutations near the GTPB4 active site 
result in nucleolar accumulation of GTPB4/ 
Nogl in mouse, consistent with trapped par- 
ticles equivalent to state A in which only the 
C-terminal fragment of GTPB4 is bound (32). 

State B represents a key transition state, be- 
cause early assembly factors bridging domains 
I, I, and VI (SURF6-SSF1-RRP15) are still present, 
whereas EBP2 has been remodeled to accom- 
modate helix 72 and stabilize NSA2 and the 
GTPase domain of GTPB4 in the posthydrol- 
ysis state (GDP bound) (Figs. 3B and fig. S23B). 
In state B, crucial rRNA elements around GTPB4, 
such as expansion segment 13 (ES13) and parts 
of helix 89 (H89), adopt immature conforma- 
tions compared with the subsequent state C, 
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in which the LI stalk is incorporated (fig. $23, 
C and D). 

Together, these data allow us to postulate a 
universal mechanism of early GTPase-mediated 
installation of the PTC that rationalizes exist- 
ing data from bacterial and eukaryotic sys- 
tems. In contrast to the prevalent model, in 
which GTP hydrolysis triggers dissociation of 
GTPB4 and its homologs in yeast and E. coli 
during late stages of assembly (33, 34), our 
model postulates that GTP hydrolysis is used to 
dock GTPB4 and its homologs early as a qual- 
ity control step to ensure correct PTC installa- 
tion. However, in the absence of a structure 
of an early GTP-bound state, a late nucleotide 
exchange followed by GTP hydrolysis cannot 
be excluded. 


DDX54 couples irreversible rRNA remodeling 
to PTC assembly 


After the GTPase-mediated partial installation 
of the PTC, the presence of GTPB4 in state B 
provides a unique binding site for the conserved 
DEAD-box RNA helicase DDX54 (Dbp10 in 
yeast) that has been implicated in PTC matu- 
ration downstream of GTPB4 (37-39). In addi- 
tion to the two characteristic RecA lobes shared 
by all DEAD-box RNA helicases, DDX54 con- 
tains a structured C-terminal region (Elbow), 
N- and C-terminal extensions (NTEs and CTEs), 
as well as a C-terminal tail, providing substrate 
specificity (9). We found that DDX54 is incor- 
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porated into state C in a posthydrolysis state and 
remains ADP bound in states C to F (Figs. 1 and 
4 and fig. S24). 

Our structures illustrate that DDX54 has a 
pivotal role during nucleolar large ribosomal 
subunit assembly by performing three essen- 
tial functions. First, DDX54 is responsible for 
the formation of functional centers such as the 
PTC and the LI stalk. Second, DDX54: provides 
directionality to 60S maturation through re- 
modeling of the pre-rRNA root helices such 
that early assembly factors can dissociate. Third, 
DDX54 acts as a recruitment platform for new 
assembly factors that catalyze downstream 
events (Fig. 4 and fig. $24). 

In its first role during the formation of func- 
tional centers, the association of DDX54 with 
GTPB4: limits conformational freedom of the 
remaining elements of the PTC (H90, H92, and 
H93) and RNA elements that accompany the 


NOP2/NIP7 complex, including the Li stalk and 


part of the E site. By acting as a clamp, DDX54 
is thus able to facilitate PTC formation, re- 
maining bound on top of H90 and H92 and 
fixing the L1 stalk in an immature conforma- 
tion, consistent with prior biochemical data 
(38) (Fig. 4, A and B, and fig. S245). The instal- 
lation of elements of the PTC is therefore cat- 
alyzed through checkpoints that are controlled 
by energy-consuming enzymes in two steps, 
with GTPB4 installing H89 and H91 and DDX54 
stabilizing H90, H92, and H93. 

In its second role, DDX54 remodels the root 
helices, first of domain IV, helix 61 (H61), and 
subsequently domain III (H47), triggering a 
series of irreversible conformational changes 
that provide directionality to 60S assembly 
(Fig. 4, A and B, and fig. S24D). In state B, root 
helices of domains III and IV are trapped by 
the SURF6-RRP15-SSF1 complex (Fig. 4A). In 
state C, DDX54 has successfully remodeled H61, 
stabilizing an extended conformation of H61 
that abolishes the binding site for the SURF6- 
RRPI15-SSF1 complex (Fig. 4B and fig. S24D). 
The unidirectionality of the removal of these 
proteins is further guaranteed by a FTSJ3 C- 
terminal segment that binds the site previously 
occupied by SURF6, thereby preventing reas- 
sociation of the state A- or B-specific SURF6- 
RRPI15-SSF1 complex while additionally blocking 
the nascent polypeptide exit tunnel together 
with RPF1 (Fig. 4, A and B, and fig. S24D). As a 
result of the release of the SURF6-RRP15-SSF1 
complex, which previously maintained domain 
VI in an immature conformation, a compac- 
tion occurs between domains V and VI, creat- 
ing a binding site for LLPH at the solvent side 
(Fig. 4B). The remodeling of H61 further in- 
volves alternate base pairing in the extension 
of the root helix of domain IV, thereby repo- 
sitioning H64 and accommodating the newly 
integrated PTC elements H90 and H92. The 
specific alternate base pairing on top of H61 
is further interrogated by DDX54 through a 
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Fig. 4. DDX54 integrates key transitions during early nucleolar assembly. (A to C) Crown views of states B (A), C (B), and D (C) showing selected assembly 
factors as they enter the respective states. Magnified views below provide more detail of the environment present before (A), during (B), and after the installation of 
DDX54 (C). (D) Schematic indicating the domain organization of DDX54 and interactions with H61 of the 28S rRNA. The cartoon representation of DDX54 below 
shows two views of its interactions with a remodeled H61. A flipped-out base (G3839) is indicated. (E) Schematics of remodeled H61 as seen in states C to F (left) and 
the mature form of H61 as seen in state G and subsequent states (right). 


flipped-out nucleotide (G3839) so that only 
the correct substrate can be accommodated 
near the PTC (Fig. 4, D and E, and fig. $24E). 
In its third role as a recruitment platform, the 
C- and N-terminal extensions of DDX54: se- 
lectively recruit NOP2-NIP7 in state C and the 
FTSJ3-NOC2L-NOC3L assembly factor com- 
plex in state D, respectively (Fig. 4, B and C, 
and fig. S24, E and F). In state D, the NOC2L- 
NOC3L complex is positioned in a downward 
orientation, thereby preventing premature 
maturation of domain III (Figs. 1 and 4C). 
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The ability of DDX54 to interact with NOC2L- 
NOC3L and the methyltransferase domain of 
FTSJ3 through the DDX54 C-terminal tail al- 
lows DDX54 to organize elements of domain 
III that include the domain II root helix H47 
as well as helices H48 and H49 (Fig. 4C and 
fig. S24, E and F). In this context, the methyl- 
transferase domain of FTSJ3 performs a sec- 
ondary structural role, whereas its substrate 
for 2’-O-methylation (nucleotide 4499 within 
the A-loop/H92) is already deeply buried un- 
derneath DDX54 (fig. S24E). The ability to 
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tether FTSJ3 already in state A (Fig. 2B) is 
compatible with the modification of nucleotide 
4499 before state G when FTSJ3 is removed. 
The structural changes observed during the 
transitions from state B to D highlight how an 
ensemble of factors centered around the DEAD- 
box RNA helicase DDX54 remodels important 
segments of pre-rRNA. Beyond enzymes such as 
DDX54 and FTSJ3, which act directly upon a 
substrate, these also include proteins that fulfill 
either architectural roles such as NOC2L-NOC3L 
or stabilizing roles such as the N terminus of 
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GNL3 that stabilizes ES13 in state C specifically 
(Fig. 4 and fig. S23C). These findings support 
a model for the cooperative actions of these 
factors that is required to advance 60S assem- 
bly, further rationalizing prior genetic inter- 
actions observed in yeast among Dbp10, Noc3, 
and Nug! (39) (Fig. 4). 


Transitions toward nuclear maturation 


After a rotation of the NOC2L-NOC3L complex 
from a downward (state D) into an upward 
position that is accompanied by the removal 
of MAK16-assembly factors at the solvent- 
exposed side (state E; Fig. 1 and fig. S25), we 
observed that a trio of WD40 repeat proteins, 
BOP1, WDR12, and WDR55 (yeast Erb1, Ytm1, 
and Jip5) chaperones the formation of domain 
III in state F (figs. S25 and S26). Subsequently, 
the transition from state F to G is marked by 
the MDNI1-mediated removal of a nucleolar 
protein interaction network. In state G, DDX54 
is replaced by a peptide of the GTPase NOG2 
that binds to GTPB4 (Fig. 1 and fig. S27). The 
initial tethering of NOG2 is followed by the 
installation of the 5S RNP in a nonrotated state 
in state H, for which we have obtained a partial 
reconstruction (Fig. 1 and fig. S27). For a more 
detailed description of these transitions, see 
the supplementary text. 


The rixosome and NOP53 couple 5S RNP 
rotation to ITS2 processing 


After nucleolar assembly of the human pre-60S, 
a critical event during the subsequent nuclear 
maturation of pre-60S precursors is the MDN1- 
catalyzed rotation of the 5S RNP from the un- 
rotated to the rotated state, which results in the 
installation of the CP (7, 15, 16, 18, 40-43). 

The structure of state I shows that the for- 
mation of this intermediate, in which the 5S 
RNP is rotated, is the product of several hier- 
archical and coupled steps (Fig. 5). Only upon 
correct rotation of the 5S RNP and the de- 
parture of early assembly factors (RPF2 and 
RRSI) can the assembly factor SDA1 and the 
rixosome stably associate, thereby stabilizing 
the L1 stalk in a bent conformation (Fig. 5). 
Association of the rixosome is required for 
ITS2 processing on two levels: The rixosome- 
associated endonuclease LASIL (6, 22) initiates 
cleavage of ITS2, whereas the bent confor- 
mation of the L1 stalk, induced by rixosome 
stabilization, serves as a binding site for a pep- 
tide of NOP53 that brings the RNA exosome 
into close proximity to its ITS2 substrate (Fig. 5 
and fig. S28). Although the RNA exosome and 
some components of the rixosome (MDNI1, 
SENP3, and NOLY) could not be visualized in 
our reconstructions, most likely because of 
flexibility, they are present in our mass spec- 
trometry data (data S1). 

Although 5S RNP rotation and bending of 
the LI stalk have been observed in the yeast 
system, limiting resolution near ITS2 has so 
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far precluded a mechanistic understanding of 
how these large-scale conformational changes 
are coupled to processing of ITS2 by the RNA 
exosome (7, 17, 40). Our high-resolution struc- 
tures of state I explain how the bent confor- 
mation of the L1 stalk and the presence of 
ITS2 are specifically interrogated by 200 ami- 
no acids of a universally conserved N-terminal 
segment of NOP53 (sensor and anchor do- 
mains), which had not previously been visual- 
ized (Fig. 5B and fig. S28). The involvement 
of the NOP53 sensor domain is supported 
by biochemical data showing that N-terminal 
truncations of this domain result in ITS2- 
processing defects (44). 

The NOP53 sensor domain is only able to bind 
the L1 stalk in the bent conformation, binding 
in such a way that in the presence of ITS2 and 
associated assembly factors, the NOP53 an- 
chor domain can additionally wrap around 
ITS2. The joint binding of the sensor and an- 
chor domains positions a recruitment pep- 
tide of the RNA exosome, the NOP53 AIM 
motif (45), in very close proximity to ITS2. 
Because the AIM motif has been shown to 
interact with the RNA exosome-associated 
RNA helicase MTR4 (46, 47), our structural data 
now provide key insights into the coupling of 5S 
RNP rotation, L1 stalk conformation, and ITS2 
processing. 


A 
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A 
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The structure of state I now rationalizes how 
defects or depletions of assembly factors in- 
volved in 5S RNP rotation and the installation 
of the rixosome result in the accumulation of 
unprocessed pre-rRNA species (78-22). In the 
absence of a stabilized bent L1 stalk, either no 
cleavage of ITS2 occurs or in cases where ITS2 
cleavage does occur, the ITS2 precursor cannot 
be efficiently processed by the RNA exosome 
because it is not positioned close to its sub- 
strate by NOP53 (28). This model also explains 
how the NOP53 anchor domain can remain 
associated with the ITS2 region during exosome- 
mediated ITS2 processing (17). 


Functional integration of the human rixosome 


State I further illustrates how the human rixosome 
and NOG2 function through an evolutionarily 
expanded protein-protein interaction network 
that is formed through CTEs of the assembly 
factors NOG2, TEX10, and LASIL, which have 
no counterparts in yeast (Fig. 5A and fig. $29). 
These CTEs facilitate three integrated func- 
tions during nuclear pre-60S maturation. 
First, extensive interactions of CTEs with NOG2 
allow the human rixosome to already associate 
with late nucleolar assembly through a pep- 
tide of NOG2, as revealed in states G and H 
(Fig. 1 and fig. $27) (0). Second, the CTE of 
NOGQ2 is also ensuring directionality during the 


120° 


Fig. 5. State-specific processing of ITS2. (A and B) Two views of state | showing selected assembly 
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factors involved in ITS2 processing. Bottom panels show a magnified view of the rixosome and its interactions 
with the pre-60S particle (A) and the NOP53 N terminus near the L1 stalk (B). The NOP53 sensor and anchor 
domains (red and orange, respectively) are shown in proximity to the AIM motif. 
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transition from nucleolar to nuclear assembly 
by probing the environment around eL34, 
where it has a mutually exclusive binding site 
with the nucleolar assembly factor FTSJ3 (yeast 
Spbl) (Fig. 5A and figs. S22B and S29D). Third, 
the large C-terminal extension of TEX10 pro- 
vides binding sites for NOG2, as well as several 
rixosome components known as the RIX] sub- 
complex (comprising two copies each of PELP1 
and WDR18) (48) and the C-terminal extension 
of the endonuclease LASIL, thereby directly 
recruiting the ITS2 endonuclease to state I (Fig. 
5A and fig. S29D). In the context of the human 
rixosome, CTEs are used to flexibly tether dif- 
ferent enzymatic activities, with LASIL, NOL9, 
and SENP3 acting as functional modules with 
endonuclease, 5’ polynucleotide kinase, and 
SUMO-specific protease activity, respectively 
(fig. S29, C and D) (27, 48, 49). 

Together, our structural data on state I high- 
light specific aspects of human ribosome as- 
sembly in which the rixosome has evolved into 
a higher-order protein interaction hub that 
directly couples 5S RNP rotation and L1 stalk 
bending to the recruitment of the endonucle- 
ase LASLI to initiate cleavage of ITS2 for sub- 
sequent exosome-mediated RNA processing 
(Fig. 5 and figs. S28 and S29). 


Processing of recombinant pre-rRNA 
in human cells 


Human ITS2 is five-fold larger than that of 
yeast. Although available biochemical data sug- 
gest that there are more pre-rRNA-processing 
sites in human cells (50), it remains unclear 
how ITS2 processing is achieved and which of 
its RNA elements are essential for cleavage 
and ribosome assembly. Similarly, it is unclear 
whether large RNA expansion segments of the 
human 28S rRNA are also required for ribosome 
assembly. By combining our structural data with 
a human large ribosomal subunit assembly as- 
say, we have determined the order of ITS2 re- 
moval and identified key regions of pre-rRNA 
that are required for LSU maturation (Fig. 6). 

Similar to what has been observed for the 5’ 
external transcribed spacer in the context of 
the human SSU processome (57), only a very 
small portion of ITS2 can be visualized in all 
pre-60S assembly intermediates (fig. S30). In 
state I1, only helices 1 and 2 of ITS2 are visible 
and are positioned close to the ITS2-associated 
assembly factors MK67I, RLP7, PES1, and RLID1 
(in yeast Nop15, Rlp7, Nop’, and Cicl, respective- 
ly) (Fig. 6A and fig. S30). States 11 to I3 represent 
different stages of ITS2 processing and disas- 
sembly of the associated protein factors, with 
elements of ITS2, RLID1, and the NOP53 an- 
chor domain disappearing in state I2 (Fig. 6B). 
The transition from state I2 to I13 is marked by 
the disordering of the remaining N-terminal 
segments of NOP53 (sensor and anchor do- 
mains) and of the structured domain of RLP7 
that is associated with MK67I (Fig. 6, B and C). 
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The minimal visualized region of ITS2 within 
the pre-60S particles prompted us to investi- 
gate whether disordered elements of ITS2 and 
expansion segments within human 28S rRNA 
are required for ribosome assembly and pre- 
rRNA processing by designing truncations in 
these regions and testing for successful ribo- 
some assembly (Fig. 6, D to F, and fig. S30, B 
and C). Whereas truncations of ITS2 were based 
on its predicted secondary structure (fig. S30B), 
28S rRNA truncations were focused on large 
expansion segments that were not visualized 
in any of our human pre-60S assembly interme- 
diates (Fig. 6D). All truncations were inserted 
into a plasmid containing an engineered human 
rDNA that, in addition to the indicated trunca- 
tions, includes unique 28S and ITS2 probes for 
Northern blotting, specifically detecting the 
formation of mature recombinant large ribo- 
somal subunits, as well as its ITS2-containing 
precursors (Fig. 6D and fig. S30, B and C). 

Within ITS2, long helices were selected for 
truncations that either lacked known cleavage 
sites (Al and A2) or contained the predicted 
LASIL cleavage site responsible for generating 
a 12S species (52) (A3), as well as combinations 
of these (A1,2 and A1,2,3). We further designed 
ITS2 variants that lacked the LASIL cleavage 
site and contained RNA segments just ex- 
ceeding what is visible in our structures (mini 
and micro; Fig. 6D and fig. S30, B and C). With- 
in the 28S, rRNA truncations were designed to 
eliminate disordered expansion segments ES7ab 
and ES15 (Aa), ES27 (Ab), or all three of these 
(Aa,b). The resulting truncations reduced ITS2 
from five times the length of yeast ITS2 to 
~60% of the length. Similarly, the combined 
truncations of expansion segments of the 28S 
rRNA resulted in a 28S rRNA that is only 20% 
larger than its yeast counterpart (Fig. 6E). 

To investigate the effects of these truncations, 
the designed variants were transfected into hu- 
man embryonic kidney (HEK)-293F cells from 
which cytoplasmic and nuclear extracts were 
obtained to identify engineered mature ribo- 
somes that were exported into the cytoplasm, 
as well as their corresponding nucleolar and 
nuclear precursors (Fig. 6F and fig. S31). We 
found that ITS2 truncations that removed 
long helices without the predicted LASIL cleav- 
age site (Al, A2, and A1,2) were processed sim- 
ilarly to the wild type without accumulation of 
a nucleolar precursor species (32S). By contrast, 
the elimination of the predicted LASIL cleavage 
site (A3 or A1,2,3) resulted in the accumulation 
of a 32S species in the nucleus. Despite this, 
mature ribosomes containing either 28S or 
32S pre-rRNA were detected in the cytoplasm 
for A3 or A1,2,3 truncations. These data sug- 
gest that either redundant enzymes or cleav- 
age sites exist to trigger ITS2 removal, and 
that despite the absence of a regular LASIL 
cleavage site, some precursors are exported. 
These observations are consistent with what 
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has been observed in yeast in the absence of 
Las1, where ribosomes containing ITS2 are 
exported into the cytoplasm and turned over 
(53-55) (Fig. 6F). The strongest accumula- 
tion of precursors was observed with minimal 
variants of ITS2 (mini and micro truncations), 
for which no processing of ITS2 was observed 
and the resulting cytoplasmic ribosomes con- 
tained ITS2 (Fig. 6F). For truncations of ex- 
pansion segments within the 28S pre-rRNA, 
no processing defects were observed compared 
with the wild type (Fig. 6F). Therefore, large 
parts of ITS2 and the 28S expansion segments 
ES7ab, ES15, and ES27 are dispensable for ribo- 
some assembly in cis in human cells, with the 
exception of the regions of ITS2 including 
the pre-60S associated helices and the LAS1L 
cleavage site, the removal of which results in 
partial processing defects. 


Transitions preceding nuclear export 


After the departure of the human rixosome, 
large-scale RNA conformational changes, to- 
gether with eukaryote-specific rRNA and pro- 
tein elements, drive the maturation of the central 
protuberance and the formation of the human 
E site (figs. S32 and S33). Subsequently, we 
observed states in which the late nuclear as- 
sembly factor TMA16, together with the newly 
identified assembly factor L10K, triggered the 
removal of PTC-proximal assembly factors 
such as GTPB4, NOG2, and NSA2 (fig. S34). 
For a more detailed description of these states, 
see the supplementary text. 


Chemical modifications of pre-RNA during 
human pre-60S assembly 


Beyond our insights into the role of numerous 
human nuclear ribosome assembly factors, 
the high resolution of our cryo-EM reconstruc- 
tions of nuclear pre-60S assembly intermedi- 
ates (states I to L) also allowed us to introduce 
and confirm the positions of many previously 
mapped covalent modifications of the human 
28S rRNA (56, 57). These include modifica- 
tions introduced through proteins that are 
guided by small nucleolar RNAs (snoRNAs) 
base pairing with RNA regions surrounding 
their substrates (2'-O-methylations and pseudo- 
uridylations), as well as modifications intro- 
duced through protein factors directly (m‘A, 
m°U, m°C, and m°A nucleobase modifications 
and a 2’-O-methylation) (58-63) (fig. S35). Dur- 
ing the assembly of the human pre-60S, the 
introduction of every chemical modification 
will require local chaperoning of substrate 
RNA, so any introduced modification is a di- 
rect representation of a prior RNA chaperon- 
ing event. By mapping the observed chemical 
modifications onto key nucleolar (states A to 
H) and nuclear assembly intermediates (states 
I to L) of our human pre-60S structures, we 
could now observe the order in which RNA ele- 
ments appear and place chemical modifications 
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relation to an engineered human rDNA locus. Magnified sections indicate 
the position of ITS2 variants (bottom left) and designed truncations of expansion 
segments ES7ab and ES15 (Aa) or ES27 (Ab) (bottom right). The locations of 


in the context of ribosome assembly (Fig. 7). This 
analysis highlights four central concepts for 
human pre-60S assembly, as described below. 

First, during nucleolar stages, domains I to 
VI assemble in an order similar to what has 
been previously observed in yeast (1-13), with 
domains I, II, and VI folding first before do- 
mains V, III, and IV are integrated (Fig. 7A, 
fig. S36, and data S2). Second, already in state 
A, all root helices, which are the origin of the 
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extracts (bottom). 


six 28S rRNA subdomains, are visible. (fig. 
$20). A high density of chemical modifica- 
tions is observed near or within these root 
helices, suggesting important prior RNA chap- 
eroning and thus highly controlled RNA fold- 
ing before the formation of state A (Fig. 7, A 
and B). Third, consistent with the notion that 
the presence of a chemical modification re- 
quires prior RNA chaperoning, we found that 
regions that are heavily modified form late 
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bottom) for both nuclear (N, yellow) and cytoplasmic (C, blue) fractions. 
Western blots for TopollB highlight the separation of nuclear and cytoplasmic 


during assembly, such as elements of domains 
V and IV. These regions include the PTC that 
is installed in two steps through the GTPase 
GTPB4 in state B (PTC-I; Fig. 7B) and the DEAD- 
box RNA helicase DDX54 in state C (PTC-II; 
Fig. 7B). In the context of ribosome assembly, 
the presence of a modified nucleotide can 
therefore be seen as a means of either delaying 
folding in a particular region or preventing 
misfolding. This logic further implies that 
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Fig. 7. Chemical modifications A 
of human 28S rRNA during 
assembly. (A) Schematic 
domain organization of the six 
color-coded domains (I to VI) of 
the human 28S rRNA. Root 
helices are indicated as darker 
blocks. Below the schematic, 
visible RNA elements are 
indicated for different states of 
the human pre-60S assembly 
pathway in the nucleolus (states 
A, B, C, and F) and nucleus 
(states | and L). Chemical mod- 
ifications are shown in gray 

or red as indicated. The 
positions of selected functional 
centers (PTC, GAC, and L1 
stalk) and RNA expansion 
segments (ES7, ES15, and ES27) 
are shown. (B) RNA structures 
of assembly intermediates 
shown in (A) with newly folded 
RNA elements color coded. 
Root helices and functional 
centers (GAC, PTC, and L1 stalk) 
are indicated. Chemical modifi- 
cations are represented as 
spheres. 
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regions that need to fold quickly at the begin- 
ning of assembly, such as domain I, should have 
a low density of chemical modifications, which 
is indeed what we observed. Fourth, regions that 
are not conserved, disordered in our reconstruc- 
tions, or not subject to extensive RNA remodel- 
ing should lack chemical modifications. This is 
precisely what we observed for expansion seg- 
ments such as ES7, ES15, and ES27, which our 
functional studies show to be dispensable for 
ribosome assembly (Figs. 6 and 7A). 

The mapping of human chemical RNA mod- 
ifications as a function of ribosome assembly 
in the nucleolus and nucleus provides a foun- 
dation with which to study both universally 
conserved chemical modifications and species- 
specific adaptations. 


Discussion 


During the nucleolar and nuclear stages of pre- 
60S assembly, functional centers need to be first 


Vanden Broeck et al., Science 381, eadh3892 (2023) 


28S domains d 
Root helices 
rRNA elements 


I 1312 ll 2363 III 2826 IV 3862 V 4572 VI 5069 
il I 
ES7aES7b ES7c ! GACES15 ! ES27a ES27b "Li stalk PTC! 
! (Ci i i a) \ 


assembled and subsequently matured. On the 
basis of our structural data, we now propose a 
simplified model for human nucleolar and nu- 
clear pre-60S biogenesis (Fig. 8) that comple- 
ments all prior work in S. cerevisiae (fig. S37). 
We highlight that emerging key nucleolar themes 
include high-level organization and hierarchy 
at the pre-rRNA and protein level (Figs. 2 and 
8), the use of energy-consuming enzymes such 
as GTPB4 and DDX54 to drive transitions, and 
the installation of functional centers in a uni- 
directional manner (Figs. 3, 4, and 8). Critical 
nuclear themes include recognition of pre- 
rRNA for LAS1L-mediated cleavage, followed 
by RNA exosome-mediated processing of ITS2 
and how the human rixosome can integrate 
these processing steps through C-terminal pro- 
tein extensions (Figs. 5 and 8). 

Beyond representing a near-complete struc- 
tural inventory of nucleolar and nuclear hu- 
man pre-60S assembly, our data now start to 
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interweave human ribosome assembly with re- 
search fields that have so far appeared discon- 
nected at a mechanistic level. The observation 
of posttranslational modifications of ribosome 
assembly factors as observed for BOP! (Fig. 2 
and fig. S22B) suggest a direct mechanistic role 
in the molecular control of human ribosome 
assembly and indicates how signaling pathways 
may integrate human ribosome assembly with 
cellular metabolism. In addition, our work lays 
the foundation for a mechanistic understand- 
ing of human diseases so that the molecular 
effects of point mutations in ribosomal pro- 
teins can be put into a temporal context of 
large ribosomal subunit assembly in patients 
suffering from a range of ribosomopathies 
(64-66). 

At the nuclear level, the structural basis for 
the coupling between 5S RNP rotation and 
ITS2 processing, as shown in state I (Fig. 5), 
highlights the importance of structural studies 


9 of 12 


& 


RESEARCH | RESEARCH ARTICLE 


Nucleolar assembly 


(Installation of functional centers - Part I (PTC & GAC) } (Organization of Domain Ill 28S rRNA) 


{Installation of functional centers - Part Il (PTC & L1-stalk)] 


(Removal of MAK16-associated factors) 


(MDN1-mediated irreversible restructuring ) Wu WOR74 RRP1 
Rv MAK16 RPFt 
i { » GNL3-GTPase 
Rotation f Sa 
% 
‘ c (Ltstaik) p 
‘ VL y oe rotation 


— VQ, 
ea € 
CF \ 


‘| Early nucleolar 
assembly factors 


(Chaperone-aided integration of Domain Ill 28S rRNA) —_\\ucleolus 


Nucleus 


(Installation of functional centers - Part Ill (CP)] 


(MDN1-mediated irreversible restructuring } 
{Maturation of functional centers-PartI(CP&E-site)} ptr nnn nnn nnn enn 


{Installation of functional centers - Part IV (CP rotation)} 


State L 


(Stepwise disassembly of ITS2 module) 


o 


ee | 


og 
o 


{ Maturation of functional centers - Part Il (PTC & P-site) | 


Fig. 8. Model for human pre-60S biogenesis in the nucleolus and nucleus. critical maturation steps such as ITS2 cleavage by LASIL (red highlight), 
Simplified schematic model of human large ribosomal subunit biogenesis in the | subsequent stepwise disassembly of ITS2 module by the RNA exosome (orange 
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with endogenous complexes. Further, it un- | Lasl cleavage and Nop53-mediated recruit- Across nucleolar and nuclear assembly, our 


derlines the complexity of eukaryotic ribo- | ment of the RNA exosome, events that cannot | analysis of chemical modifications of RNA in- 
some assembly in which MDN1-induced 5S | be reconstituted by using only enzymes in- | dicates that chemical modifications can be 
RNP rotation precedes ITS2 processing with | volved in ITS2 processing (/7, 67). used as a readout of prior RNA chaperoning 
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during the assembly of human pre-60S sub- 
units (Fig. 7). This notion is in agreement with 
additional functions that have been described 
for chemical modifications. First, chemical mod- 
ifications in yeast and human cells may also 
serve as checkpoints during assembly, where 
they may be interrogated by assembly factors 
(fig. S29B) (10). Second, although chemical 
modifications can result in additional stabi- 
lization of the ribosome, complete ribosomes 
can be formed even in the presence of cata- 
lytically dead enzymes involved in pseudo- 
uridylation while the conformational freedom 
of subunits including intersubunit rotation is 
partially affected (68). Third, the heterogeneity 
of chemical modifications as observed in the 
human ribosome (69) can be explained as the 
result of several redundant sites of chemical 
modifications that together guarantee effi- 
cient ribosome assembly without the need for 
selective and quantitative modification. These 
data suggest that in archaeal and eukaryotic 
systems, in parallel to proteinaceous assembly 
factors, smoRNA-mediated modifications have 
evolved as a second layer of control to ensure 
the productive biogenesis of ribosomes. 


Methods summary 


Full details of the material and methods can 
be found in the supplementary materials. In 
brief, CRISPR/Cas9-based genome editing was 
used to generate a biallelically tagged MK67I- 
GFP cell line from which human pre-60S as- 
sembly intermediates were affinity purified 
and structurally characterized using cryo-EM. 
In addition, an in vivo recombinant ribosome 
assembly assay using an engineered human 
rDNA locus was used to investigate whether 
rRNA elements within ITS2 and the 28S rRNA 
are required for the large ribosomal subunit 
biogenesis. 


MK67I-GFP** cell line generation 


Two DNA repair templates were designed, each 
encoding a GFP tag followed by a surface dis- 
play sequence. These templates were cloned 
into a vector along with a plasmid expressing 
the single-guide RNA and the Cas9 enzyme. 
The DNA mixture was transfected into HEK- 
293F cells, which were then allowed to recover 
and expand. Cells expressing GFP were selected 
using fluorescence-assisted cell sorting, and 
then cells positive for both surface display epi- 
topes were further selected. Single-cell clones 
were expanded and genotyped to confirm bi- 
allelic editing. 


Purification of human pre-60S 
assembly intermediates 


For the purification of human pre-60S assem- 
bly intermediates, the MK67I-GFP cell line 
was grown in suspension cultures. Cells were 
harvested and nuclei were extracted from the 
cells. Intact nuclei were resuspended in sol- 
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ubilization buffer, and the lysate was spun down 
to remove the insoluble fraction. The super- 
natant was incubated with anti-GFP nanobody 
beads. The complexes were eluted using 3C- 
protease cleavage. Purified pre-60S particles 
were used for further analysis. 


Recombinant human rDNA engineering 


A full copy of the human rDNA sequence con- 
taining different truncations in ITS2 and 28S 
was cloned into a plasmid, and unique probe 
sequences were introduced to monitor the 
production of recombinant ribosomes. The 
engineered rDNA plasmids were transfected 
into human HEK-293F cells, which were then 
used for isolation of recombinant ribosomes 
from different cellular compartments and 
further analysis by Northern blot. 


Cryo-EM data collection and processing 


Cryo-EM grids were prepared using the pu- 
rified pre-60S particles. Cryo-EM images were 
collected on a Titan Krios electron microscope. 
Cryo-EM data processing and model building 
were performed using standard software and 
procedures. 
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An improved bound on the electron’s electric 
dipole moment 


Tanya S. Roussy*{, Luke Caldwell’?+, Trevor Wright, William B. Cairncross’+, 
Yuval Shagam?§, Kia Boon Ng”, Noah Schlossberger’, Sun Yoo! Park’, Anzhou Wang", 
Jun Ye", Eric A. Cornell’2* 


The imbalance of matter and antimatter in our Universe provides compelling motivation to search 
for undiscovered particles that violate charge-parity symmetry. Interactions with vacuum 
fluctuations of the fields associated with these new particles will induce an electric dipole moment of 
the electron (eEDM). We present the most precise measurement yet of the eEDM using electrons 
confined inside molecular ions, subjected to a huge intramolecular electric field, and evolving 
coherently for up to 3 seconds. Our result is consistent with zero and improves on the previous best 
upper bound by a factor of ~2.4. Our results provide constraints on broad classes of new physics 
above 10” electron volts, beyond the direct reach of the current particle colliders or those likely 


to be available in the coming decades. 


lectric dipole moments of fundamental 
particles, such as the electron, are sig- 
natures of time-reversal symmetry viola- 
tion, equivalent to violation of combined 
charge and parity (CP) symmetry (J). CP 
symmetry is broken in the standard model but 
only in the quark sector (2), so the coupling to 
leptons is weak and the predicted electron’s 
electric dipole moment (eEDM) several orders 
of magnitude below current experimental sen- 
sitivity (3, 4). Explaining the imbalance of mat- 
ter and antimatter in the Universe requires 
additional CP violation, beyond that present 
in the Standard Model (5-7). Many proposed 


extensions predict new particles at energies 
higher than any so far discovered, with CP- 
violating interactions. These new particles 
can induce a much larger eEDM, often within 
reach of near-term experiments (8-10). A non- 
zero Measurement at current experimental 
sensitivities would unambiguously signal new 
physics, whereas a more precise measurement 
consistent with zero imposes challenging 
constraints on possible explanations of the 
matter-antimatter imbalance. Our measure- 
ment uses quantum projection-noise-limited 
spectroscopy on samples of hundreds of mo- 
lecular ions with interrogation times of up to 
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3 s. Our result, de = (—1.3 + 2.Ogtat + 0.6sy 
10-*° e cm, is consistent with zero and 
an upper bound of |d-| < 4.1 x 107-%° e cmat 
90% confidence. 

An eEDM d, = d,s—with § a unit vector 
along the spin of the electron—subject to an 
electric field, €, has an energy of —d, - €. The 
essence of an eEDM search is to measure the 
energy shift when § is aligned with € com- 
pared with when it is antialigned. The size 
of the observable shift scales with the size 
of €, and thus many existing (17-13) and pro- 
posed (J4-17) eEDM experiments use elec- 
trons embedded inside polar molecules, where 
intramolecular electric fields can be ~10° 
times larger than what can be directly ap- 
plied in the lab. These internal electric fields 
can be aligned in the lab frame by orienting 
the molecules with modest external electric 
fields. 

Our measurement uses HfF* molecular ions. 
In an applied electric field of ~58 V cm™!, the 
3A, (v= 0, J = 1) “science” state of the mo- 
lecule is split into a series of doublets (Fig. 1A). 
In two of these doublets, highlighted in col- 
or, the molecule is oriented (J8); the upper 
doublet (orange) has the intramolecular axis 
parallel to the applied field, whereas the lower ‘ 
doublet (blue) is antiparallel. This intramolec- 
ular axis defines the direction of an effective 
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lon trap 


Molecular beam 


Fig. 1. Experiment outline. (A) Level structure of the eEDM-sensitive °A, (v = 0,J = 1) state. The horizontal axis indicates mp, the projection of the total angular 
momentum onto the externally applied electric field. The vertical axis indicates the energy of the states. The direction of the electron spin and effective electric field, 
Ef, |S indicated for each of the states used in the experiment. (B) Schematic of ion trap, composed of eight radial electrodes and a pair of endcap electrodes. (Inset) 
Fields applied during experimental sequence: the rotating electric bias field, Erot. and the quadrupole magnetic field, B’. 
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electric field, E ef = 23 GV cm’ (19-22), acting 
on the spin of one of the valence electrons. In 
the presence of a small magnetic field, the two 
states in a doublet correspond to the spin of 
this valence electron being aligned or anti- 
aligned with Egg. We prepare a coherent super- 
position of the two spin states and measure 
the energy difference using Ramsey spec- 
troscopy. The eEDM will give a contribution 
to this energy, +2d,€ ef, with opposite sign in 
the two doublets. We perform the measure- 
ment simultaneously on spatially overlapping 
clouds of ions prepared in each of the dou- 
blets. The difference between the measured 
energies is our science signal. 


Experimental overview 


Our experimental apparatus is shown sche- 
matically in Fig. 1B. An overview of the ex- 
perimental sequence is given here; more details, 
including an account of improvements made 
since our earlier result (12), are presented in 
(23, 24). The sequence begins with produc- 
tion and radiofrequency trapping of roughly 
20,000 HfF* ions. To orient the molecules 
while maintaining confinement, we rotate 
the orienting field, €,,;, at angular frequency 
rot = 2m x 375 kHz and perform our spec- 
troscopy in this rotating frame. We also apply 
a quadrupole magnetic field gradient, B®, to 
create a time-averaged effective bias magnetic 
field, Brot (24). 


AONB ae Mh omen 
Nini S Nhat 
0 2 4 
B 


0 300 


We prepare an incoherent mixture of one 
of the spin states from each doublet, either 
\t") and |t") or |{") and |\') (Fig. 1A), and 
then apply a 5 pulse to create a coherent 
superposition of the two states in each dou- 
blet. We allow the superpositions to evolve 
for a variable amount of time, then apply a 
second 3 pulse to map the accumulated rela- 
tive phase between the states in a doublet 
onto a population difference between those 
states. We clean out the population in one of 


the spin states in each doublet, either Nis » or 
|\°/"), then count the number of ions in the 
remaining stretched states by state-selectively 
photodissociating the molecules and detect- 
ing the resultant Hf* ions (25). We use the op- 
posing orientations of the two doublets in 
the trap to send the Hf" ions originating from 
molecules in each doublet to opposite sides of 
our imaging microchannel plate (MCP) and 
phosphor screen assembly (26, 27). We then 
repeat the procedure with the opposite initial 


————————————_ _ —_ ESSE 
Table 1. Summary of systematic shifts and their uncertainties. Data are as presented in (23). All 


values are in microhertz. 


Effect 


Magnetic 


Correction Uncertainty 


6 770 772 774 


600 900 
Ramsey time (ms) 


1538 1540 1542 


1200 1500 


Fig. 2. Example Ramsey data. (A) Detection of Hf ions; ions are assigned to the upper or lower doublet based on their position (upper doublet shown in orange, lower 
doublet in blue). Counts from a thin central swatch where the assignment is ambiguous (shown in gray) are removed. Images shown are averaged over 60 shots of 
the experiment. (B) Asymmetries for the upper and lower doublet. (C) Fitted sum and difference asymmetries, As and Ap, used to extract mean and difference frequencies, 
fm and fy. Middle-time data, where the two doublets are out of phase, were collected in this example dataset for illustrative purposes only. Such data contribute very 
little to the frequency determination and were not collected during the final precision dataset. 
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spin state. Example data are shown in Fig. 2A. 
We count the ions on each side of the screen, 
in each configuration—typically ~120 ions from 
each doublet after a 3-s hold time—and from 
these two measurements construct the two 
asymmetries, A, and A}, where 


1 1 

Avy = Mi — Nika (1) 
u/l u/1 u/1 
Nin at Nanti 


Here, Np "and nw are the number of ions 
counted, u/1 indicates the upper or lower dou- 
blet, and the subscripts indicate whether we 
read out the same state that we prepare (In) or 
the opposite (Anti). We repeat our measurement 
at different free-evolution times, generating a 
pair of Ramsey fringes as shown in Fig. 2B. The 
frequencies of these two fringes are proportional 
to the energy splitting in the two doublets. The 
primary contribution to this energy splitting is 
the Zeeman splitting, 3grugBrot, where gr = 
—0.0031(1) is the g factor of the science state 
(28). For our typical experimental parameters, 
this produces fringe frequencies of ~100Hz. 
Other effects, including the eEDM, make small 
modifications to this frequency. 

Instability of the intensity of the pulsed 
lasers used for creation and photodissociation 
of the ions (24) creates considerable noise in 
the number of ions measured in each shot of 
the experiment, typically approximately three 
times as large as the quantum projection- 


Fig. 3. Systematic shifts in O 
the measurement. (A) Shift in 
f°8 caused by nonreversing 
magnetic quadrupole field, 
which can be corrected by 
using the shift in the f® 
channel, which is ~460 times 
as large as the f>8shift. (B) 
Shift in the f° channel caused 
by deliberately applied second- 
harmonic electric field €>, with 
transverse magnetic field, B. 
Data show variation in shift as -15 
angle @5,, between E>, and x axis 
is varied. Eo, is ~250 times 

as large as that present in the 
experiment; B is ~ 14 mG. (Inset) 
Lissajous figure traced out by 
total electric field for greatly 
exaggerated ratio of Ea, /Erot: 
blue and orange show one-half 
cycle each. The field points in 
the —x direction for more time 
than in the -+-x direction. 
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noise limit on the side of the fringe at 3 s. 
However, these sources of noise, and many 
others, are common mode between the two 
doublets—the exact same laser pulses address 
both clouds of ions—and so the noise in A, and 
A; is highly correlated. To take advantage of 
this, we form the sum and difference asymme- 
tries As = Ay + A; and Ap = Ay — A; (Fig. 
2C). If we take data when the two doublets are 
close to being in phase, the noise in Ap is 
drastically reduced (27). The two doublets 
oscillate at slightly different frequencies, f, 
and fi, owing to a~1/230 fractional difference 
in their magnetic moments, and so during 
the eEDM dataset we deliberately take our 
data at a beat. We take two sets of points: the 
early-time data, when the two doublets are in 
phase, and the late-time data ~230 oscillations 
later, when they come back into phase again. 
We can control the time of the second beat 
by varying the strength of the magnetic bias 
field, B,ot. We fit to Ag and Ap to extract the 
mean of the two fringe frequencies, fm = 
+ (fa +/fi), and their difference, fa = 4 (fa —Ji)- 

We collect Ramsey fringes in 2* = 8 exper- 
imental states, corresponding to each possi- 
ble combination of three binary experimental 
switches, {B,R,1} = +1. B is the direction of 
the magnetic bias field relative to Exot, R the 
rotation direction of E;ot, and/ the direction of 
Exot Yelative to the imaging MCP at the instant 
of photodissociation, determining which side 


31/2 


of the phosphor screen each of the doublets is 
imaged onto. A set of Ramsey fringes in each of 
the 8 switch states forms a block. To minimize 
the effects of experimental drifts, within a block 
we interleave data collection for the switch 
states; the first Ramsey time is recorded for all 
switch states before moving onto the second 
Ramsey time for each switch state, and so on. 
We take the 16 fitted frequencies from each 
block and form 16 linear combinations to give 
the components of the measured frequencies, 
which are even or odd under each of the experi- 
mental switches. Following (29), we label the 
components with superscripts that denote 
the switches under which the quantity is odd. 
For example, our science signal is {?8, the 
component of the difference frequency that is 
odd under B but even under R and I (30). The 
other channels allow us to diagnose system- 
atics and monitor experimental performance. 

Over the course of the dataset, we varied a | 
number of other experimental parameters on 
timescales slower than a block. These include 
the state we read out at the end of the Ramsey 
sequence, denoted P = +1 and alternated each 
block; the order in which the switch states 
are recorded at each Ramsey time, alternated 
every other block; three different magnitudes 
of the magnetic bias field, corresponding to 
mean fringe frequencies of f°~77, 105, and 
151 Hz; and reversal of the waveplates that 
set the lab-frame handedness of the light used 
for state preparation and readout. During data 
collection and analysis, we “blinded” our mea- 
surement of f?? by adding an unknown offset 
to this channel. The offset was not removed 
until our systematics search and analysis (23) 
were complete. 


Accuracy evaluation 


To evaluate the accuracy of our measurement, 
we searched extensively for systematic shifts 
before data collection; a summary is given in 
Table 1. In general, we tuned a variety of ex- 
perimental parameters over ranges that were 
large compared with those present during 
data collection, exaggerating any accompany- 
ing systematic effects, and observed the re- 
sponse in our data channels. The only shift we 
could observe directly in the eEDM channel 
stems from a nonreversing quadrupole mag- 
netic field and the difference in magnetic 
moments between the two doublets, caused 
primarily by the applied electric field mixing 
the states of the two doublets with higher 
rotational levels of the molecule. The f? chan- 
nel provides a direct measurement of the non- 
reversing magnetic field and allows us to apply 
a correction to our science channel, §f23 = 
f? Bee, where dg; is half the difference between 
the g factors for the upper and lower doublets 
(Fig. 3A). Before applying any corrections to 
the science channel, we suppress this system- 
atic by actively shimming the currents through 
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Fig. 4. Summary of our 
dataset. Cuts have been 
applied and the uncer- 
tainty on each block 
scaled by \/x2 to account 
for overscatter. (A) Histo- 
gram of data. Error bars 
show standard deviation of 
bin counts expected from 
Poisson distribution. The 
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distribution. (B) Normal 886 N < 22k Kos 
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the overall average value 
of f°8. Here, N is the 


from mean 


average number of trapped HfF* ions per experimental trial during a block. Other panels in (C) are described 
in the final paragraph of the Experimental Overview section. 


the coils that apply the magnetic bias field 
to minimize f*% This shimming was so ef- 
fective that the mean correction that we ap- 
plied was well below our statistical sensitivity, 
F032 = 90 nHz. 

The shimming and correction procedure 
leaves us susceptible to other possible effects 
that cause shifts in f? and f?8 with a ratio 
different from Bee. An important example of 
such a shift is the combination of a transverse 
magnetic field with an electric field oscillating 
at 2,9, which was present in our experiment 
owing to harmonic distortion in the amplifiers 
driving the trap electrodes. The harmonic dis- 
tortion causes the electric field, and thus the 
magnetic moment of the molecule, to spend 
more time pointing in one spatial direction 
than the other, giving a nonzero time-averaged 
Zeeman interaction with a background mag- 
netic field. This causes shifts in f? but no cor- 
responding shifts in f?8, where the effect is 
canceled by the coincident change in the size 
of the differential magnetic moment owing 
to the distortion. Figure 3B shows the shift 
in f? when we deliberately apply a second- 
harmonic electric field and vary its angle. We 
shim out the second harmonic on each elec- 
trode by feeding forward a second-harmonic 
signal with the opposite phase, suppressing 
the amplitude by ~80 dB. We used magnetic 
shim coils to null the ambient magnetic field 
at the trap center to <10 mG. The measured 
sizes of the residual effects were used to com- 
pute the maximum size of the systematic 
during our dataset. 
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A full account of all systematic shifts con- 
sidered is presented in (23). 


Measuring the eEDM 


We collected 1370 blocks over about 2 months, 
corresponding to ~620 hours of data and ~10° 
ion detection events. Each block results in one 
value of f?? and thus a single measurement of 
d,. The uncertainty on f?8 for each block is 
calculated with only the standard errors on the 
asymmetries for that block. We applied cuts to 
the blinded data on the basis of non-eEDM 
channels that indicated signal quality. Blocks 
with late-time contrast <0.2 were cut because 
of a low signal-to-noise ratio, as were blocks 
containing fitted fringe frequencies that were 
>3.50 different from the mean fringe frequen- 
cy for that switch state. After applying cuts, we 
were left with 1329 blocks with y? = 1.07(4) 
for f?®. Figure 4, A and B, show the distrib- 
ution of measured f?? values over the 1329 
blocks after relaxing the uncertainty for each 
of the blocks by a factor of V2 = 1.035. The 
data are consistent with a normal distribution. 
Our final statistical uncertainty of 22.8 uHz 
is obtained with these relaxed uncertainties. 
Based on the number of ions detected in 
each shot, this uncertainty is ~30% above the 
quantum projection-noise limit. More details 
on how we determine uncertainties are given 
in (24). 

Figure 4C shows how the measured value 
of f?® depends on experimental parameters 
varied during the dataset; we find no con- 
cerning dependencies. 


We removed our blind on 1 November 2022, 
and obtained a final value for the eEDM- 
sensitive frequency channel 


FP? = 14.6422. Bstar + 6.9syst WHz (2) 


Dividing by —2&er¢ S88") ~ 1.11 x 107! wHz 
ecm’ (21, 3D), we obtain a value for the eEDM 


de = (-1.3 + 2.Ostat + 0-6gst) X 107° e cm (3) 


which is consistent with zero within one stan- 
dard error. The combined statistical and sys- 
tematic uncertainty, og, = 2.1 x 10°-7°e cm, 
improves on our previous work (72) by a factor 
of ~37, and on the previous state-of-the-art 
from the ACME collaboration (73) by a factor 
of ~2. This result and that of the ACME 
collaboration—two measurements using very 
different experimental platforms with con- 
trasting sources of systematic shifts—are con- 
sistent at slightly above one standard error. 


Discussion 


We use our result to obtain an upper bound 
using a folded Gaussian distribution 


|d.| < 4.1 x 10-° e em (90% confidence) 
(4) 


This limit constrains extensions to the 
Standard Model that predict new sources of 
CP-symmetry violation to explain the matter- 
antimatter asymmetry of the Universe (32). 
Many extensions, including supersymmetry, 
the two-Higgs model, and left-right symmetric 
models, generate an eEDM at the one-loop 
level (33), with magnitude (9) 


edpa. g” 7 


sind me (5) 
2 on |? MP 


de ~ 


Here, M is the characteristic mass of new par- 
ticles with effective coupling strength, g, to the 
electron; cp is the phase that describes how 
strongly the interaction violates CP symmetry; 
Me and e are the mass and charge of the elec- 
tron respectively; and o is the fine-structure 
constant. Because d,°<M~?, and because our 
limit in Eq. 4 is a factor of ~2.4 smaller than 
the limit reported in (13), we are sensitive to new 
particles with mass that is /2.4 = 1.5 times 
as large. 

To estimate the mass reach of our exper- 
iment, we need to make assumptions for the 
size of g? and sindcp. For the strong force, 
quantum electrodynamics, and the weak force, 
gx1l /137, and 10~°, respectively. For exten- 
sions to the Standard Model seeking to explain 
the matter-antimatter asymmetry, the naive ex- 
pectation is that sindgp ~ 1. With this assump- 
tion, we can interpret our new limit on d, as 
M2 (g/ 02) 40 TeV. For new particles with g? 
of order o ~ 1/137, this bound is an order of 
magnitude greater than the largest-mass 
particles that can be directly detected at the 
Large Hadron Collider (34). 
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So far, we have assumed that CP violation 
arises purely from @,. Diatomic molecules are 
also sensitive to pseudoscalar-scalar electron- 
nucleon coupling, Cs (35, 36), and we can 
interpret our measurement as a linear combi- 
nation, hf?® x sgn(gr) = —2Ee#de + 2WsCs, 
where us = —51kHz (3D) is a molecule-specific 
structure constant. Assuming that d, is zero, we 
can instead attribute our measurement to Cs, 


and we find 
Cs = (1.4 + 2.2stat + O-Zeyst) X 107° (6) 


Determining rigorous limits on d, and Cs 
requires combining the results of two or more 
measurements using molecules with different 
ratios of Eg to Ws. Figure S1 shows a combined 
fit to the results of this work and (13), giving 
upper bounds of |d,| < 2.1 x 10-% ecm and 
|Cs| < 1.9 x 10-® with 90% confidence. Our 
measurement improves these bounds by fac- 
tors of 16 and 12, respectively (37). 
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A lithium superionic conductor for millimeter-thick 


battery electrode 


Yuxiang Li'}, Subin Song”, Hanseul Kim’, Kuniharu Nomoto’, Hanvin Kim‘, Xueying Sun7§, 
Satoshi Hori’, Kota Suzuki", Naoki Matsui, Masaaki Hirayama’, Teruyasu Mizoguchi®, 
Takashi Saito*®*, Takashi Kamiyama*°4, Ryoji Kanno‘ 


No design rules have yet been established for producing solid electrolytes with a lithium-ion conductivity high 
enough to replace liquid electrolytes and expand the performance and battery configuration limits of current 
lithium ion batteries. Taking advantage of the properties of high-entropy materials, we have designed a 
highly ion-conductive solid electrolyte by increasing the compositional complexity of a known lithium superionic 
conductor to eliminate ion migration barriers while maintaining the structural framework for superionic 
conduction. The synthesized phase with a compositional complexity showed an improved ion conductivity. We 
showed that the highly conductive solid electrolyte enables charge and discharge of a thick lithium-ion battery 
cathode at room temperature and thus has potential to change conventional battery configurations. 


ll-solid-state lithium batteries (ASSLBs) 

have attracted research interest because 

the solid battery configuration—which 

uses lithium superionic conductors as 

the electrolyte instead of liquids in the 
current lithium batteries—has potential for 
improved safety and enhanced energy-power 
characteristics (-4). After decades of research 
all-solid-state cells are capable of discharging 
at a high current density of >10 mA cm™ (5). AL 
though the cathode used in that study was only 
tens of micrometers thick, such fast discharge 
properties imply high energy and power densi- 
ties for future ASSLBs, especially at elevated tem- 
peratures such as 100°C in which the operation of 
Li-ion batteries using organic liquid electrolytes 
(e.g., those containing LiPF, and ethylene car- 
bonate) are currently restricted due to safety 
concerns (4, 6, 7). These results are attributed to 
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solid electrolytes with Li* conductivities higher 
than 10 mS cm” at room temperature, which is 
comparable to that of traditional liquid elec- 
trolytes (8-10). These solid electrolytes include 
LijpGeP.S,2 (LGPS) (8) and Li-argyrodites rep- 
resented by LigPS,;Cl (17). A conductivity of 25 mS 
cm” was reported for Lig 54Siz74P1.44S17Clos 
(LSiPSCl) (5) having the same kind of crystal 
structure as that of LGPS (LGPS-type struc- 
ture), which is characterized by a bcc-like anion 
sublattice, a framework composed of tetrahedral 
[P/Tt]S, units (Tt = Si, Ge, Sn), and widely dis- 
tributed Li sites (12, 73). 

Although the conductivities of these solid 
electrolytes are comparable to those of liquid 
electrolytes their stiff nature can be a dis- 
advantage. As a result of the difficulty in wetting 
the surface of the cathode active material as a 
catholyte, solid electrolytes must be incorporated 
at the microstructural level into the electrode, 
resulting in void space inside it. This hinders 
the homogeneous supply of Li* to the active 
material, leading to a capacity loss (/4) that 
makes the performance of ASSLBs inferior 
to that of Li-ion batteries despite the superior 
conductivity of the solid electrolytes relative to 
liquid ones. This problem presumably becomes 


Fig. 1. Relationship between the 
crystal structure indicator 

(t, volume ratio of anions to 
that of cations) and composi- 
tional complexity metric (S,nix) 
for reported LGPS-type (open 
circles) and argyrodite-type (open 
squares) electrolytes. The newly 
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explored phases in this work are indi- 
cated by filled triangles. Gray shading 
indicates the Veum-anion/ Vsum-cation 
range for LGPS-type phases and 

the red filled triangle corresponds to 
the phase with the highest conductivity. 
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PS 
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more pronounced in millimeter-thick elec nee 
des, where ohmic drop increases when! .2— 
conductivity of solid electrolyte is low (15). In 
addition, the capacity decrease of the solid- 
state battery would be even more severe at low 
temperatures (below 0°C) in which the con- 
ductivity of the electrolyte drops sharply. 

The history of the development of inorganic 
ionic conductors suggests that conductivity 
enhancement was achieved using the multiple 
substitution strategy. Ag* (16) and Cu* (7) 
conductors displayed high conductivities of 
>200 mS cm™ at room temperature (17-19) 
through compositional changes from AgI to 
RbAg,I, or Cul to Rb4CuygI7Ch3. This approach 
corresponds to the “mixed (poly)-anion” (20, 27) 
or “high-entropy” (22, 23) design reported 
previously for the development of some Li* 
conductors, in which the aim was to flatten 
the potential barrier for Li* migration (27). The 
multisubstitution approach is attractive for 
realizing phases with higher ion conductivity, 
provided that the materials possess a specific 
crystal structure having superionic conduction 
pathways, such as LGPS-type crystal struc- 
tures. However, there has been no practical 
principle to realize compositional complexity 
in the anions and cations or to avoid collapse ‘ 
of the target crystal structure. Consequently, 
this multi-element substitution strategy has 
not been used in designing superionic crystals 
with high conductivity nor have researchers ‘ 
explored the full potential of the crystal struc- 
ture that enables superionic conduction. 

We developed a solid with high Li* conduc- 
tivity using a high-entropy material design 
while preserving the target crystal structure. 
The simple design rule, as shown in Fig. 1, is 
derived from the chemical composition trend 
of well-known argyrodite-type and LGPS-type 
superionic crystals (8, 17). The diagram shows 
the plot for two composition parameters: the « 
crystal structure indicator (¢) and the compo- ‘ 
sitional complexity metric (S,,jx). The former is 
the ratio of sum of volumes for constituent 
anions (Vsum-anion) to that of cations including . 
lithium (Veum-cation) and can be calculated from 


A This work 
Oo LGPS 
a Argyrodite 


O 
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Crystal structure indicator t (= Vsum-anion/ Vsum-cation) 
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Fig. 2. Structure analysis of A 
LSiPSBrO. (A) Neutron Rietveld 
refinement pattern at -269°C. The 
observed and calculated patterns 
and the residual differences are 
shown in red, cyan, and blue, 
respectively. Green bars indicate 
the positions of Bragg reflections. 
Lattice-plane spacing d ranges 
excluded from the analysis are not 
shown. (B) Schematic of the 
LSiPSBrO crystal structure 
determined by Rietveld analysis. 
Occupancy of the S, Br, and 

O atoms at the S sites are 
indicated by yellow, green, and 

red colors, respectively. 


the ionic radius (24). The latter originates from 
the configuration entropy of mixing in high- 
entropy alloys and can be calculated from the 
occupancy disorder for the anion and cation 
species (except lithium). The calculation details 
of t and S,,ix are presented in figs. S1 to S4. 
Table SI lists the ¢ and S,,;x values for the newly 
synthesized materials and previously reported 
argyrodite-type and LGPS-type phases. 

The gray shaded region in Fig. 1 indicates 
that preservation of the LGPS-type structure 
requires stricter control of the ¢ parameter com- 
pared with that for the argyrodite-type struc- 
ture. Because pure phase construction is the 
prerequisite for realizing the full potential of 
compositionally complex materials, we chose 
a high S,ix value and a feasible ¢ value as the 
design guide for effectively exploring the conduc- 
tive phases. Based on this rule, in this study we 
modified the LGPS-type LSiPSCI to design a series 
of compositions; Lig 54[Si;_sMsh.74Pi445n1Bro3006 
(LSiM;PSBrO, M = Ge, Sn; 5 and 1-5 are the 
molar ratios of M** and Si** in the target com- 
position, respectively, with 0 < 5 < 1). The 
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designed compositions have high values of 
Smix (solid triangles in Fig. 1) not achieved 
by previously reported LGPS- or argyrodite- 
type material groups (listed in table $1) whereas 
their ¢ parameters are within the range of 
previously reported LGPS-type single phases. 
After systematically synthesizing and charac- 
terizing these selected compositions, the high- 
est observed o value was 32 mS cm ‘at 25°C. 


Synthesis and structure analysis 


First, we carried out the synthesis and struc- 
tural analysis for the simplest of the designed 
LSiM;PSBrO series with 3 = 0 (i.e., LSiPSBrO). 
Figure 2A shows the neutron diffraction pattern 
of LSiPSBrO at -269°C, which was used for 
structure determination by Rietveld refine- 
ment. The pattern was indexed with the same 
space group as that of LGPS (P4./nme) and 
unchanged at 272°C (fig. S5 and table S2), 
indicating that the phase maintains an LGPS- 
type crystal structure up to this high tempera- 
ture. In the Rietveld refinement the initial 
structure model was taken from the literature 


Si1/P1(S/Br/O)a 


Si2/P2(S/Br/O). 


w) Occupancy(S1) = 0.9441(19) 


VY Occupancy(S2) = 0.9165(18) 


peel tana paneer Inatanpenearnnh th Ameren pre 


woe ov 


Vv Occupancy(S3) = 0.914(2) 


(12). Figure 2B shows the crystal structure of 
LSiPSBrO at —269°C, based on the determined 
structural parameters listed in table S3. The Br 
and O atoms in LSiPSBrO prefer to averagely 
substitute the three sulfur (S) sites. The ob- 
tained increase in constituent anion species— 
which are highly disordered in LSiPSBrO— 
could potentially lead to energy landscape 
flattening and a lower activation energy barrier, 
thus favoring fast ionic migration (9, 25, 26). 
This expectation is supported by the energy 
barrier simulation using ab initio calculation 
(fig. S6), which indicated that although the sub- 
stitution degree for S atoms was low (<10 mol%), 
the influence on the potential energy land- 
scape for Li* migration could be substantial. 
Thus, the current high-entropy design may en- 
hance Li* conduction. 

Based on LSiPSBrO, structural analysis was 
further performed for the Ge- and Sn-substituted 
LSiM;PSBrO samples (0 < 5 < 1). As expected 
from the ¢ parameter, the designed composi- 
tions demonstrated reasonably pure LGPS-type 
phases, with the ratio of more than 85 wt.% as 
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Fig. 3. lonic conductivity. 
Arrhenius plots for the 

bulk conductivities of Ligs54 
[SioneGeo4hi74P1.44Su1Bro3006 
and LGPS. Inset: typical imped- 
ance plots measured at -25°C 
(right) and 25°C (left). 


log[a/S cm] 


determined from Rietveld refinement (table S4). 
Specifically, LGPS-type single phase was ob- 
tained in the range of 0 < 4 < 0.5 for the Ge- 
substituted samples and 0 < 5 < 0.4 for the 
Sn-substituted samples. The powder x-ray 
diffraction (XRD) patterns are shown in fig. $7. 
The lattice parameters were calculated from 
the synchrotron XRD data and are shown in 
fig. S8. The unit cell volume increased as the 
amount of Ge/Sn increased in the LGPS-type 
LSiM;PSBrO phase, confirming the substitu- 
tion of Si by either Ge or Sn. Thus, the solid- 
solution limits for these LGPS-type phases 
were determined. Differential thermal analy- 
sis and high-temperature XRD (fig. S9) indi- 
cated that the solid-solution series is thermally 
stable up to 400°C. 


lonic conductivity 


The series of monophasic LSiM;PSBrO samples 
showed conductivities higher than 10 mS cm™ at 
room temperature, even in the compressed 
powder state with grain boundary resistance 
(table S4 and fig. S10). The intracrystal conduc- 
tivity was further evaluated for the composition 
at M = Ge and 6 = 0.4 in LSiM;PSBrO (i.e., Lig 54. 
[Sio.6Geo.4h.74P1.4451Br0.300.6, abbreviated as 


bd 


Voltage / V vs. Li/In-Li 
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Areal capacity / mAh cm” 


Utilization efficiency % 
40 
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LSiGePSBrO hereafter), because the LSiGePS- 
BrO phase formed with the high phase purity 
and showed nearly maximum conductivity 
values. Figure 3 shows the Arrhenius plots in 
the temperature range of —50°C to 60°C for 
hot-pressed LSiGePSBrO (see figs. S11 to 13 for 
details of measurements). The bulk conduc- 
tivity at room temperature was determined to 
be 32 mS cm”, which is approximately three 
times higher than that of the original LGPS 
and is the highest conductivity among Li* con- 
ductors reported to date. It should be noted 
that at -10°C, LSiGePSBrO still exhibits a high 
conductivity of 9 mS cm, which is compara- 
ble to that of LGPS at room temperature. 
These extremely high conductivities can po- 
tentially facilitate the efficient use of active 
materials in all-solid-state cells particularly 
with thick cathode configurations. 


Application to thick battery electrode 


To demonstrate the potential of highly con- 
ductive LSiGePSBrO in ASSLBs, it was used 
in a heavily loaded cathode by mixing with 
LiNbO;-coated LiCoO, (LNO-LCO) as the active 
material. A solid-state cell was prepared by 
loading the cathode mixture at 245 mg cm™”, 
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which corresponds to an areal theoretical ca- 
pacity of 23.5 mAh cm™” and a thickness of 
800 um. Figure 4A shows the discharge curves 
at a current density of 0.587 mA cm 7 (0.025 C 
rate) in the temperature range of 25°C to —-10°C. 
At 25°C, the cell showed an areal discharge 
capacity of 22.7 mAh cm? with 97% utiliza- 
tion efficiency for the input LCO at the cutoff 
voltage of 4.25 V versus Li*/Li. This observed 
areal discharge capacity is compared with 
others for ASSLBs (Fig. 4B) (4, 14, 27-30). 
More than 92% areal discharge capacity was 
reattained after the cycle test over 100 days, 
indicating no significant decomposition of 
LSiGePSBrO in the cathode at 4.25 V versus 
Li*/Li (figs. $14 and S15). When the tem- 
perature decreased, the discharge capacity 
gradually decreased and the overpotential 
increased. The observed overpotential is at- 
tributed to the gradual decrease in the ionic 
conductivity of LSiGePSBrO at lower temper- 


atures. Nevertheless, the discharge capacity : 


at -10°C is still 75% of that at 25°C. Compared 
with recently reported heavily loaded all-solid- 
state cells (4, 27), the present cell with a higher 
cathode weight delivered better capacity reten- 
tion at low temperatures (Fig. 4C). Combined 
with lithium metal anode, the cathode com- 
posite with LSiGePSBrO discharged at a large 
current density of 144 mA cm” at 60°C (fig. S16). 
The charge-discharge operation was confirmed 
even for an electrode 1-mm thick (fig. S17). 

In conclusion, a series of solid electro- 
lytes with the composition of Lig 54[Siy_;M5] 
174P1.445111Br9.300.6 (M = Ge, Sn; 0 < 6 $1) was 
designed to achieve high configuration entropy 
while preserving their crystal structure with a 
superionic conduction pathway. The bulk con- 
ductivity of monophasic LSiGePSBrO (M = Ge, 
5 = 0.4) was found to be 32 mS cm™ at 25°C. 
Theoretical calculations and structural analy- 
ses suggested that even the small degree of 
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Fig. 4. Performance of all-solid-state cells. (A) Discharge curves for the thick electrode presented herein (~800 pm) under 0.587 mA cm (0.025 C) at different 
temperatures ranging from 25°C to -10°C. Inset: cross sectional scanning electron microscopy image of the cathode composite pellet with a capacity loading of 
23.5 mAh cm. (B) Comparison of areal capacities at 25°C for present and reported all-solid-state cells. The applied current density was between 0.2 and 

0.68 mA cm”. (C) Comparison of areal capacities (lower) and discharge capacity retention rate (upper) at 25°C and lower temperatures. 
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chemical substitution in LSiGePSBrO could 
flatten the energy barrier for ion migration, 
thereby rationalizing the observed conductivity 
enhancement in this phase. An all-solid-state 
cell with a heavily loaded cathode (thickness 
800 um) incorporating LSiGePSBrO as the 
catholyte demonstrated a discharge capacity of 
22.7 and 17.3 mAh cm at 25 and -10°C, corre- 
sponding to the utilization efficiency of 97 and 
73% for the active material, respectively. This 
study highlights the significance of a high con- 
ductivity in the charge-discharge performance 
of ASSLBs using a thick cathode configuration. 
The design rule presented herein may accel- 
erate the exploration of superionic conductors. 
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Cortical polarity ensures its own asymmetric inheritance 
in the stomatal lineage to pattern the leaf surface 


Andrew Muroyama’*, Yan Gong't, Kensington S. Hartman’, Dominique C. Bergmann™>* 


Asymmetric cell divisions specify differential cell fates across kingdoms. In metazoans, preferential 
inheritance of fate determinants into one daughter cell frequently depends on polarity-cytoskeleton 
interactions. Despite the prevalence of asymmetric divisions throughout plant development, 
evidence for analogous mechanisms that segregate fate determinants remains elusive. Here, we 
describe a mechanism in the Arabidopsis leaf epidermis that ensures unequal inheritance of a 
fate-enforcing polarity domain. By defining a cortical region depleted of stable microtubules, the 
polarity domain limits possible division orientations. Accordingly, uncoupling the polarity 

domain from microtubule organization during mitosis leads to aberrant division planes and 
accompanying cell identity defects. Our data highlight how a common biological module, coupling 
polarity to fate segregation through the cytoskeleton, can be reconfigured to accommodate 


unique features of plant development. 


symmetric divisions establish differen- 
tial cell identities by positioning daugh- 
ter cells relative to fate-enforcing extrinsic 
signals or by segregating intrinsic fate 
determinants (1, 2). The development 
of many plant organs depends on asymmetric 
divisions that place daughter cells in proximity 
to neighbor-derived signals such as mobile tran- 
scription factors (3, 4), hormones (5-7), or small 
peptides (8, 9). However, whether plants have 
mechanisms that ensure preferential inheri- 
tance of fate regulators remains unresolved. 

To investigate how plant cells couple asym- 
metric divisions to identity specification, we 
monitored the cell division and differentia- 
tion dynamics of stomatal precursors in the 
Arabidopsis leaf epidermis (Fig. 1A). In this 
lineage, flexible asymmetric divisions in mor- 
phologically heterogeneous early lineage cells 
create and pattern stomata (cellular valves that 
mediate plant-atmosphere gas exchange) by 
priming divergent developmental trajectories 
in the two daughters. The smaller meristemoid 
will eventually give rise to the paired guard 
cells that comprise a stoma, and the larger 
stomatal lineage ground cell (SLGC) will expand 
to become a pavement cell. 

All asymmetric cell divisions within the 
lineage are preceded by the formation of a 
plasma membrane-associated polarity com- 
plex defined by BREAKING OF ASYMMETRY 
IN THE STOMATAL LINEAGE (BASL) (0) 
and BREVIS RADIX family (BRXf) (77) pro- 
teins (Fig. 1, A and B). Before mitosis, the 
BASL/BRXf crescent recruits another polarly 
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localized protein, POLAR (72), which in turn 
enriches GLYCOGEN SYNTHASE KINASE3 
(GSK3)-like kinases at the cortex to promote 
asymmetric divisions (73), and then directs 
nuclear migration to bias the division site 
(74). After it is inherited by the SLGC through 
asymmetric division, cortical BASL/BRXf en- 
hances MITOGEN ACTIVATED PROTEIN 
KINASE (MAPK) signaling to suppress meri- 
stemoid identity (13, 15). Despite its central 
role in coordinating asymmetric division entry 
and daughter cell identity after division (16), 
how BASL/BRXf asymmetric inheritance is 
regulated is unknown (Fig. 1B). 


Stomatal lineage cells break common division 
orientation rules 


We performed time-lapse imaging of de- 
veloping cotyledons harboring markers for 
nuclei [R2D2 (17)], the plasma membrane 
(MLIp::mCherry-RCI2A), and the polarity cres- 
cent (BRXL2p::BRXL2-YFP). In agreement with 
previous observations, all asymmetric divi- 
sions resulted in singular inheritance of the 
BRXL2 crescent. Close analysis of these cells, 
however, revealed two asymmetric division 
subclasses that were defined by their divi- 
sion planes. The majority (73% of divisions) 
divided along the calculated shortest distance 
that intersected opposing cell walls at the site 
of the nucleus [i.e., a small angle between the 
calculated shortest wall and the division plane 
(A0°); see the materials and methods] (Fig. 1, C 
to F), thus following the expectations set by the 
observed division planes in many plant cell types 
(8, 19). For this class of asymmetric divisions, 
polarity-directed nuclear migration (14) coupled 
with minimization of the division plane ac- 
curately predicted the final division site. 
Division planes in the second class of asym- 
metric divisions (27% of divisions) deviated 
significantly from the calculated shortest wall 
(Fig. 1, D, E, and G), suggesting that additional 


9 


inputs control the orientation of these € oot 
lineage divisions. Stomatal lineage divis\-—2 
are not unique in breaking the shortest wall 
rule, but other occurrences of them during 
Arabidopsis development are related to broad, 
extrinsic influences such as tissue mechanics 
or hormone signaling (20, 27). Discrete, cell- 
autonomous mechanisms that tune division 
orientation have not been described. The mor- 
phological heterogeneity of stomatal precursors 
was well represented within both asymmetric 
division subclasses (Fig. 1, F and G), suggesting 
that unique geometric features do not define 
asymmetric division subtypes. Instead, asym- 
metrically dividing cells specifically bypassed 
the calculated shortest division plane (large 
A0°) when that wall was predicted to intersect 
the plasma membrane within the cortical po- 
larized site (Fig. 1G). If only the nonpolarized 
membrane was considered permissive for 
division plane placement, then these asym- | 
metric divisions continued to follow the shortest 
wall rule (fig. S1). This correlation suggested 
that polarized BASL/BRXf may be a cell- 
intrinsic cue capable of constraining poten- 
tial orientations to control its own asymmetric 
inheritance. 

Next, we tested whether cell polarity is ‘ 
necessary to stratify the two asymmetric divi- 
sion classes by tracking progenitor divisions in 
basl mutants (basl 35Sp::PIP2A-RFP MLIp::H2B- 
YFP). Loss of cellular polarity in basil collapsed * 
the two asymmetric division classes into one 
(Fig. 1, H and I) that varied significantly from the 
total wild-type asymmetric divisions (Kolmogorov- 
Smirnov test, P = 0.0005). basi divisions did not 
differ significantly from wild-type asymmetric 
divisions without polarity conflict (P = 0.1568), 
indicating that bas! divisions follow the short- 
est wall rule. Therefore, the polarized BASL/ 
BRXf domain is required to override default 
division patterns during formative asymmetric - 
divisions. c 


BASL influences preprophase band position 


To determine the basis of this control, we . 
examined the cortical microtubule structures 
that play essential roles during division orien- 
tation in plant cells (22). TANIp::CFP-TAN1 
foci, which mark the cortical division zone 
(23), never appear within the BASL/BRXf 
domain, suggesting that BASL/BRXf operate 
at an early step during division orientation 
(fig. S2). We found that the preprophase band 
of microtubules, which is the first marker of 
the eventual cortical division site, never formed 
within the polarity domain (Fig. 1, J to M). An 
analysis of the A6° between the preprophase 
band and the calculated shortest wall showed 
a similar bifurcation of asymmetric divisions 
into two classes. The majority (66%) had pre- 
prophase bands that closely aligned to the 
predicted shortest wall, whereas preprophase 
bands in the second class (34%) deviated 


1 of 6 


RESEARCH | RESEARCH ARTICLE 


A BRXL2p::BRXL2-YFP 
ML1p::mCherry- Boe 


«1 Predicted shortest wall 
== Division plane 


(op 


A@°=/6 -0 


Cc BRXL2p::BRXL2-YFP R2D2 ML1p::mCherry-RCI2A 


F No polarity conflict 


oe A@° 1.1° 5.8° 


Bice iS J | 


Bpivision ~ Predicted | 


Fraction of ACDs 


a 
oO a 
CC D Wild-type ACDs < 
oO 08 n=95 2 7 
| 8 
: : Q 60° 3 G iis conflict 
Neat . A@° 
B Polarization <x | 1 2 i C3 > 
BASL/BRXf 2. 30° ge we 
ACD 5 4 | 
e- O- & Bo I _ 
Eiesg L se . AB° 70.4° 595° 60.4° 
' ‘ Polarity functions eae 
Post-ACD (SLGC) H basil *** ey 
; ; D 3 bas! ML1p::H2B-YFP 35Sp::PIP2A-RFP 
Active y Promote Sans of a oF aw 
meristemol ’ 
See ie identity < os AG? I > ge 
> P (eo) r 
=@ nave > |SPCH iad as 
position , 3 
© 0.2 
£ = 
- A@° 4.6° 1.9° 4.8° 
J BRXL2p::BRXL2-YFP L M 
TMMp::mCherry-TUA5 
Medial section Max. projection 
ee 7 . 
s x2 8 - 
F cr - oe 
; a : a 
E ge £ E 
Zz S 2 2 


-15 


-10 
Distance (um) 


5 0 5 
Distance (um) 


v 


Fig. 1. The Arabidopsis stomatal lineage polarity domain overrides default 
division rules during asymmetric cell division. (A) Left: image of the abaxial 
epidermis from a 3 dpg BRXL2p::BRXL2-YFP (magenta) MLl1p::mCherry-RCI2ZA 
(black) cotyledon. Cells have been pseudocolored according to the color scheme 
below. Scale bar, 25 um. Right: selected polarized cells highlighting the 
morphological diversity of cells in this lineage. Scale bar, 5 um. (B) Stomata 
(purple) are created through coupled divisions (asymmetric, ACD, and 
symmetric, SCD) and fate transitions. Top: illustration of the major fate 
transitions within the Arabidopsis stomatal lineage. Meristemoids are shown in 
green and the guard mother cell in blue. Bottom: illustration of the known 
functions for the BASL/BRXf domain. Before division in SPEECHLESS-positive 
(SPCH*) cells, cortical BASL/BRXf promotes ACD and instructs nuclear 
migration. After division in SLGCs, inherited BASL/BRXf/YODA initiates a MAPK 
cascade, leading to suppression of SPCH. Mechanisms controlling singular 
polarity domain inheritance are needed. (C) The assay used to quantify Ae*. 
This assay was used for the data shown in (D) to (I). For additional details, 

see the materials and methods. (D) Quantification of A@° during wild-type ACDs. 

n = 95 cells. (E) Percentage of total wild-type ACDs in which polarity did (27%) or 
did not (73%) conflict with the predicted shortest wall. (F and G) Left: distribution 


of Aé in ACDs in which the predicted shortest wall did (G) or did not (F) conflict 
with the polarity domain. Right: three examples of ACDs from their respective 
classes with associated Aé°s. The black dotted line marks the calculated shortest 
wall, and the magenta arrow indicates the polarity domain. Scale bar, 5 wm. 

P < 0.0001, Kolmogorov-Smirnov test comparing the A@°s distributions in the 
two ACD classes. (H) Quantification of Ae° during presumed meristemoid 
divisions in bas/. P = 0.0005, Kolmogorov-Smirnov test with WT ACDs. 

n = 94 cells. (1) Examples of three early-lineage divisions in bas! with associated 
A@°s. The black dotted line marks the predicted shortest wall. Scale bar, 5 um. 
(J) Representative images of a medial (left) and maximum projection (right) 
ew of preprophase band (PPB) placement relative to the polarity domain in 
RXL2p::BRXL2-YFP TMMp::mCherry-TUAS5. Black arrows indicate the PPB. Scale 
ar, 5 um. (K) Reslice of the cell in (J) showing microtubule and BRXL2 
stribution along the anticlinal wall. Black arrows indicate the PPB. Scale bar, 
um. (L) Line scan along the cell cortex of the asymmetrically dividing cell 
hown in (J). The arrows indicate the position of the PPB. (M) Microtubule 
stribution (gray lines) in PPB-forming cells (n = 26 cells, average is shown in 
ack), aligned to the midpoints of the BRXL2 crescents (magenta line shows 
the mean + SD). 
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significantly from the shortest distance (fig. S3). 
In this second class, preprophase bands did 
not align with the predicted shortest wall 
when they bisected the polarity domain, and 
polarity was required for preprophase band 
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realignment away from the shortest wall (fig. 
S3). Together, these data indicated that the 
BASL/BRXf polarity crescent might orient 
divisions by controlling preprophase band 
placement. 
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To test this hypothesis, we generated 
lines to monitor BRXL2 inheritance in the 
trm678 mutant (trm678 BRXL2p::BRXL2-YFP 
MLIp::mCherry-RCI2A), which does not form 
preprophase bands (24). In contrast to wild-type 
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asymmetric divisions, in which BRXL2 was 
inherited by a single daughter cell, new cell 
walls frequently bisected the polarity site in 
trm678 (32% of divisions) (Fig. 2, A and B), 
showing that the preprophase band is re- 
quired to ensure complete inheritance of po- 
larized BASL/BRXf. Next, we monitored cell 
fate outcomes following incorrect BASL/BRXf 
inheritance by tracking progression through 
the stomatal lineage using a fluorescent re- 
porter of MUTE (25), a transcription factor 
that establishes guard mother cell identity. 
trm678 asymmetric divisions in which BRXL2 
was correctly inherited by a single daughter 
cell showed normal lineage progression. MUTE 
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t= -2:00hr 


Col-0 


trm678 


Cc BRXL2p::BRXL2-YFP D 


MUTEp::MUTE-YFP 
ML1p::mCherry-RCI2A 


expression was detectable in the smaller cell 
after the division, and all tracked MUTE’ cells 
in trm678 became paired guard cells (fig. S4). By 
contrast, t7m678 cells in which cortical BRXL2 
was bisected by the nascent division plane 
tended to generate daughters that both in- 
herited cortical BRXL2, never transitioned to 
MUTE’ cells, and failed to become pavement 
cells (Fig. 2C). In agreement with these track- 
ing data, 7 d postgermination (dpg) t7m678 
cotyledons had fewer stomata and a mispat- 
terned epidermis (Fig. 2, D to F). We did not 
observe any alterations in the relationship be- 
tween microtubules and BRXL2 in interphase 
trm678 cells (fig. S4, D to F). Therefore, we 
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Fig. 2. Singular inheritance of the BASL/BRXf polarity domain through preprophase band 
placement is required for stomatal patterning. (A) Stills from representative time-lapse movies 
showing cell division orientation in Col-O (top) and trm678 (bottom) relative to the BRXL2-YFP-marked 
cortical polarity domain. Scale bars, 10 um. (B) Quantification of Col-O (n = 112) and trm678 (n = 116) 


ACDs with singular inheritance (BRXL2 inherited by only one daughter) or double inheritance (BRXL2 
inherited by both daughter cells). (©) Representative time-course images with associated cartoons 


showing an instance of double inheritance and cell fate stalling 62 hours later from a trm678 seedling. 


Scale bar, 25 um. (D) Representative images of the epidermis of 7 dpg Col-O and trm678 cotyledons 


expressing MLlp::mCherry-RCI2A. Stomata are pseudocolored purple. Scale bars, 25 um. (E) Quantification 
of stomatal densities (number of stomata per 581.82 um x 581.82 wm area) in 7 dpg Col-O and trm678 


cotyledons (35 seedlings each). P = 0.0002, unpaired t test. (F) Quantification of the percentage of 


paired stomata per 581.82 um x 581.82 wm area in 7 dpg Col-O and trm678 cotyledons (35 seedlings 


each). P = 0.0012, unpaired t test. 
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conclude that the preprophase band serves as 
an essential link between the polarity domain 
and division orientation to regulate stomatal 
identity. 


Cortical BASL domains are locally depleted of 
stable microtubules 


How does the BASL/BRXf crescent influence 
preprophase band establishment? By creating 
a stomatal lineage-specific microtubule reporter 
line (TMMp::mCherry-TUAS), we could analyze 
microtubules and the BRXL2 polarity domain 
along anticlinal walls with high resolution 
(Fig. 3A). Unexpectedly, anticlinal microtubules 
were strongly depleted from the plasma mem- 
brane within the polarized domain, even in 
interphase SLGCs (Fig. 3, A to C). We con- 
firmed that the same microtubule depletion 
zone occurred when using a second polarity 
reporter, BASL, and in stomatal lineage cells 
of true leaves (fig. $5). POLAR, which shows | 
overlapping but distinct localization from 
BASL/BRXf (72), co-localizes with microtu- 
bules outside of the BASL/BRXL2 domain (fig. 
85), indicating that microtubule depletion is 
correlated specifically with BASL/BRXf and 
is not a generalized activity of polarized pro- 
teins in the stomatal lineage. 

BASL/BRXf could either locally deplete 
microtubules or opportunistically polarize to 
already microtubule-poor regions. To distin- 
guish between these possibilities, we per- 
formed two analyses. First, we examined 
cortical microtubule distribution before BASL 
polarization and found no microtubule- 
depleted region (fig. S6). Second, we com- 
pared microtubule distribution in wild-type 
and polarity-defective SLGCs and found that 
microtubule distribution was more homoge- 
neous in basl and brx-quad (11), which abrogate 
polarity, and in a line in which addition of a 
myristoylation signal [BASLp::MYR-BRX-YFP 
(11)] renders BRX localization largely uniform 
at the cortex (Fig. 3, D to G, and fig. S7). We 
also followed unmanipulated SLGCs as they 
lost polarized BASL/BRXf several hours after 
cell division. Our time course analysis showed 
that anticlinal microtubules reappeared within 
previously polarized regions in mature SLGCs 
(fig. S7E). From these data, we conclude that 
BASL/BRXf polarity creates and is required to 
maintain local microtubule loss at the plasma 
membrane. 

Mutual inhibition by opposing plasma 
membrane-associated domains can drive 
polarization, as in the conserved PAR net- 
works in animals (26) or in a recently described 
polarity system in the monocot Brachypodium 
distachyon (27). Because our data raised the 
possibility that microtubules and cortical BASL/ 
BRXf could operate in an analogous manner 
and inhibit the spread of each other, we tested 
whether altering microtubule distribution 
affected the stomatal lineage polarity domain. 
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Fig. 3. The BASL/BRXL2 
domain locally depletes corti- 
cal microtubules from the 
plasma membrane. (A) Top: 
representative medial section of a 
polarized SLGC and associated 
meristemoid in BRXL2p::BRXL2- 
YFP TMMp::mCherry-TUAS. 
Bottom: reslice showing micro- 
tubule and BRXL2 distribution 
along the anticlinal face of the 
same cell. Scale bars, 5 wm. 

(B) Line scan along the cell 
periphery of the SLGC shown in 
(A) showing the normalized BRXL2 
and TUA® fluorescence intensities. 
(C) Microtubule distribution in 
SLGCs (gray lines) (n = 72 cells) 
aligned to the midpoints of the 
BRXL2 crescents. Magenta line 
corresponds to the mean signal + 
SD. Black line is the average of the 
plotted microtubule signals. é 
(D) Top: representative medial 
section of stomatal lineage cells in 
bas! TMMp::YFP-TUAS. Bottom: 
reslice showing microtubule 
distribution along the anticlinal face 
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of the same cell. Scale bars, 5 um. (E) Line scan along the cell cortex of the larger daughter cell in (D) showing the normalized TUA5 fluorescence intensity. (F) Microtubule 
distribution in bas! SLGCs (gray lines) (n = 58 cells). (G) Asymmetric microtubule depletion scores calculated from integrated fluorescence intensities in polarized and 
unpolarized cortical domains (see the materials and methods) in wild-type (n = 50), bas! (n = 40), brx-quad (n = 31), and BASLp::MYR-BRX (n = 22) SLGCs. ***P < 0.0001, 


one-way ANOVA with Tukey's post hoc test. 


In agreement with previous results (28, 29), we 
found that microtubules were not necessary for 
the formation of a polarized BASL/BRXf domain 
(fig. S8A). However, quantification revealed a 
slight but significant spread of the polarity do- 
main along the anticlinal wall in the absence of 
microtubules (fig. S8B). Short plasmolysis treat- 
ments, which substantially disrupt cortical micro- 
tubule distribution, similarly altered polarity 
boundaries and polarity domain size without 
complete depolarization (fig. S8C). Therefore, 
microtubules shape BASL/BRXf domain boun- 
daries but are dispensable for polarity itself. 
This interaction shares marked similarities 
with microtubule-mediated sculpting of ROP 
GTPase domains in nondividing cells of the 
xylem and in trichomes (30-32). 


Microtubule dynamics are locally altered 
within polar domains 


How do cortical BASL/BRXf locally deplete 
cortical microtubules? Because of the technical 
challenges associated with monitoring dynamic 
microtubule behavior along the anticlinal walls 
of meristemoids, we created a heterologous 
system in which we could track microtubule 
dynamics coincident with the BASL polarity do- 
main. By introgressing a ubiquitous microtubule 
reporter, 35Sp::mCherry-TUAS, into a line express- 
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ing a hyperactive version of BASL capable of 
rescuing the bas/ phenotype, 35Sp::GFP-BASL-IC 
[hereafter referred to as BASL°?* (70)], we 
could monitor microtubule organization relative 
to BASL in the hypocotyl epidermis. BASL°°P* 
locally depleted microtubules along anticlinal 
walls in the hypocotyl epidermis, as in the 
stomatal lineage (fig. S9), demonstrating that 
this ectopic system recapitulates the molecular 
interactions found in the leaf epidermis. 
BASL*°P!° domains extended onto the 
apical surfaces of hypocotyl epidermal cells 
and locally depleted cortical microtubules 
(Fig. 4, A and C), allowing us to observe the 
BASL-mediated effects on microtubules with 
a precision not possible within the stomatal 
lineage. Increased microtubule severing has 
been identified as a key reorganizer of cortical 
microtubule arrays during several develop- 
mental transitions (33, 34). However, because 
severing preferentially occurs at microtubule 
crossover sites (35, 36) and there were few 
crossovers in microtubule-depleted BASL°P° 
regions, severing was largely suppressed 
within BASL°* domains (fig. $10, A and B). 
Tracking of microtubule minus ends within 
BASL*°P also indicated that local microtubule 
depletion was not caused by decreased minus- 
end stability (fig. S10C). Instead, we found that 


BASL°°P'* had two effects on microtubule 
plus ends. First, plus-end polymerization and 
depolymerization rates were significantly sup- 
pressed within BASL°°* (Fig. 4, E and F, and 
fig. S10D). Second, we observed that micro- 
tubule plus ends rapidly underwent catas- 
trophe upon entering the BASL°“°?* domain 
(Fig. 4, B and D). Increased catastrophe rates 
often led to complete loss of the microtubule, 
reestablishing the microtubule depletion zone. 

We used two independent approaches to 
validate that our BASL™°?° findings in the hypo- 
cotyl reflect BASL-microtubule interactions 
within the stomatal lineage. First, we performed 
time-lapse imaging of stomatal progenitors 
in BRXL2p::BRXL2-YFP TMMp::mCherry-TUA5 
seedlings and observed transient microtubules 
within the polarity domain that were rapidly 
depolymerized (fig. S11A). Second, to monitor 
growing plus ends with higher precision along 
the anticlinal wall, we generated a stomatal 
lineage-specific END BINDING PROTEIN 1b 
(EBIb) reporter, TMMp::EB1b-mCherry, and 
introgressed it into the BRXL2 reporter line. 
Fewer EBib puncta were observed within the 
native polarity domain in SLGCs than in non- 
polarized regions of the same cells (fig. S11, 
Bto D). Therefore, our analyses of microtubule 
dynamics in native and heterologous systems 
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Fig. 4. Polarized BASL destabilizes microtubule plus ends to locally bias 
array organization. (A) Representative images of the apical surfaces in 
epidermal cells of 35Sp::GFP-BASL-IC 35Sp::mCherry-TUA5 hypocotyls. The 
boxed regions below highlight the microtubule organization within the polarized 
domains of two cells. Scale bars, 10 um. (B) Kymographs showing microtubule 
plus-end dynamics within a nonpolarized region (left) and an ectopic polarity 
domain (right). Asterisks indicate microtubule catastrophes and rescues. Scale 
bar, 1 um. (C) Local microtubule depletion within apical BASL°*?"° domains and 


reveal that the BASL/BRXf domain destabil- 
izes microtubule plus ends to locally deplete 
them from the polarized region. 


Discussion 


As encoders of spatial information, polarity 
domains are central regulators of asymmetric 
cell division in diverse plant tissues (37-40). 
Here, we provide evidence that the BASL/BRXf 
polarity domain robustly serves dual functions 
to orient asymmetric divisions and specify cell 
identity by controlling its own inheritance 
through negative interactions with the micro- 
tubule cytoskeleton. Polarity-microtubule inter- 
actions now emerge as a common theme to 
guide asymmetric inheritance of fate regulators 
during both metazoan and plant asymmetric 
divisions, albeit through fundamentally differ- 
ent mechanisms (fig. S12). In the canonical 
asymmetric cell division pathway in animal 
cells, the cortical polarity domain is respon- 
sible for localizing fate determinants to one 
pole and subsequently directing the division 
angle to ensure their singular and asymmetric 
inheritance by exerting pulling forces on as- 
tral microtubules (41, 42). The model we ad- 
vance here differs in several significant ways. 
First, the proposed mechanism uses core 
plant-specific mitotic structures without the 
need to invoke a role for astral microtubules, 
which are absent in plant spindles. Second, 
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rather than ensuring its singular inheritance 
by precisely specifying the ultimate division 
plane, BASL/BRXf renders a region of the 
membrane unavailable as the division site. 
Third, this mechanism is uniquely suited to 
ensure asymmetric inheritance of key fate 
regulators within a morphologically heteroge- 
neous population. 

How does BASL-mediated polarity modu- 
late microtubule dynamics? BASL and BRXf 
proteins both contain large, disordered re- 
gions (43), leading us to favor two general 
models. In the first, BASL and BRXf scaffold 
effectors that directly affect plus-end kinetics 
and potentially bind along the microtubule 
lattice to affect depolymerization rates. From 
our analyses in this work, we anticipate that 
such microtubule-associated effectors would 
be expressed throughout the cell cycle and in 
multiple tissues, which has complicated our 
ongoing efforts to identify them. In the sec- 
ond model, which does not invoke additional 
downstream factors, polarization through phase 
separation could tune the local physical prop- 
erties of the membrane-adjacent cytoplasm. 
Such a mechanism can modulate microtubule 
dynamics in yeast (44) and might be hinted at 
by the dampening of microtubule dynamics at 
the polarity zone (fig. S10D). 

Of the documented, polarity-mediated asym- 
metric divisions in plants, those that violate the 


E F RK 

. 3.6 41.2 304 12.4454 
= € 
< 28412 €£ 
E 6 & 
=) > 20 
2 4 i 
e 7 6.1+3.1 
o 4 £ 
g A 
= 5 10 
2 £ 
= 2 3s 
& o 

a 
0 
NP P 


comparably sized random regions in control hypocotyls. The local microtubule 
depletion was derived using 35Sp::mCherry-TUA5 fluorescence (see the 
materials and methods). n = 25 hypocotyl cells for each. P < 0.0001, unpaired 
t test. (D to F) Quantification of microtubule plus-end dynamics within 
nonpolarized (NP) and polarized (P) apical domains. Growth persistence (D), 
polymerization rate (E), and depolymerization rate (F) were quantified. The 
numbers above the box-and-whisker plots are mean values + SD. For all 
comparisons, unpaired t tests were used (P < 0.0001). 


shortest wall rule, such as those in the early 
Arabidopsis embryo or subsidiary mother cell 
divisions in Zea mays, may be the closest cor- 
ollaries to the system presented here. Although 
BASL is both eudicot (43) and stomatal lineage 
specific, BRX family proteins participate in 
additional cellular decisions in Arabidopsis and 
are much more deeply conserved in the green 
lineage (45), hinting that other tissue-specific 
regulators could provide context specificity to 
a common polarity core. Further analysis of 
polarity will help to clarify whether this mech- 
anism is shared across plant tissues and species 
or if it has evolved for the challenges associated 
with flexible patterning in the eudicot stomatal 
lineage. 
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Interface passivation for 31.25%-efficient 
perovskite/silicon tandem solar cells 
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Silicon solar cells are approaching their theoretical efficiency limit of 29%. This limitation can be 
exceeded with advanced device architectures, where two or more solar cells are stacked to improve the 
harvesting of solar energy. In this work, we devise a tandem device with a perovskite layer conformally 
coated on a silicon bottom cell featuring micrometric pyramids—the industry standard—to improve its 
photocurrent. Using an additive in the processing sequence, we regulate the perovskite crystallization 
process and alleviate recombination losses occurring at the perovskite top surface interfacing the 
electron-selective contact [buckminsterfullerene (Cgo)]. We demonstrate a device with an active area of 
1.17 square centimeters, reaching a certified power conversion efficiency of 31.25%. 


he deployment of photovoltaics (PV) can 

be accelerated by maximizing the elec- 

trical power produced per unit area be- 

cause the cost distribution of a PV system 

is now dominated by the balance of 
system components (such as the mounting 
system, wiring, manpower, and inverters) 
rather than by the cost of the PV panels. 
This balance of system costs scales roughly 
with the installed area and favors PV tech- 
nologies with a high power-to-panel area 
ratio. However, the record power conversion 
efficiency (PCE) of crystalline silicon (c-Si) 
solar cells of 26.8% (1) is approaching the 
theoretical limit of 29.5% (2). The only ex- 
perimentally validated approach to overcome 
this PCE limitation under 1-sun illumination 
conditions consists of combining several com- 
plementary photoactive materials (meaning 
multiple junctions) in a single device (3). Among 
the different types of multijunction designs 
reported to date, the combination of c-Si with 
a metal halide perovskite in a tandem solar 
cell has been the focus of intense research 
efforts given the potential for both high PCE 
and low manufacturing costs (4). 
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Metal halide perovskites combine several 
key properties for effective multijunction PV, 
including a high absorption coefficient with a 
sharp absorption edge (5), ambipolar charge 
transport with long diffusion lengths (6, 7), 
and a compositionally tunable bandgap en- 
ergy (E,) (8). A thin-film perovskite solar cell 
can be deposited directly on the front side of 
a c-Si cell to lower thermalization losses and 
extend the range of achievable PCE to >30% 
(3). The performance potential of monolithic 
two-terminal tandem architectures is illus- 
trated by reported PCEs as high as 33.7% for 
1-cm? illuminated areas (1). Most of the high- 
efficiency tandem cells reported to date use a 
Si wafer in which the front surface is mech- 
anically or chemically polished (9) or features 
an adapted submicrometric texture smaller 
than the perovskite layer thickness (typically 
from 500 nm to 1 um) (J0-13). This planar or 
nanotextured front-side topography—wafers 
used in the PV industry are usually etched to 
form pyramids of a few micrometers in height— 
enables the deposition of pinhole-free perov- 
skite films by standard solution-based pro- 
cesses. However, this modification comes at 
the expense of optical performance because 
the front side of the tandem is flat, and also 
when using a submicrometric Si texture, be- 
cause the latter is planarized by the non- 
conformal solution-processed perovskite film. 
Consequently, these cell designs exhibit more 
reflection losses at the front of the tandem 
because of an absence of double-bounce effects 
(14). Overall, a pyramidal texture at the front of 
the tandem device limits reflection losses 
because it enables the absorption of reflected 
light in neighboring pyramids, whereas the 
presence of texture on both sides of the Si wafer 
improves the trapping of infrared light (75). 

We previously reported a hybrid two-step 
deposition method combining thermal evapo- 
ration and spin coating to conformally coat 


0 


Chec 


the perovskite layer on micrometric Si fF ae 
mids, resulting in perovskite/c-Si tandem... 
which both rear and front sides were textured 
(16). Although these tandem cells had a high 
photocurrent thanks to the front-side pyrami- 
dal texture, nonradiative recombination losses 
were substantial. One challenge is that most 
top-surface passivation approaches reported 
to date are not directly applicable to micro- 
metric textures because they involve depositing 
nanometric organic layers from liquid solutions 
(77-19). These processing routes usually yield 
nonconformal (incomplete) coatings on such 
surface textures. Building on our previous 
work (16), we demonstrate enhanced tandem 
performance using phosphonic acids in two 
different roles to passivate interfacial defects: 
first, as a hole-transport layer (HTL), [4-(3,6- 
dimethyl-9H-carbazol-9-yl)butyl phosphonic acid 
(Me-4PACz) (9) and second, as a perovskite ad- 
ditive in the form of 2,3,4,5,6-pentafluoro- 
benzylphosphonic acid (FBPAc) (20). Together 
with a micrometric texture standard in the c-Si 
PV industry and an optimization of the layers 
located at the front of the tandem for further 
optical gains, these passivation strategies 
mitigate voltage losses and result in tandem 
device efficiencies >30%, with an accredited ‘ 
value of 31.25%. 


Identification and mitigation of voltage losses 
in perovskite top cells r 


To assess loss pathways triggered by non- 
radiative recombination, we measured the 
photoluminescence quantum yield of different 
layer stacks representative of our p-i-n perov- 
skite solar cells used in tandem cells. We 
quantified the internal voltage potential (in 
relative terms), or quasi-Fermi-level splitting 
(QFLS), created within the cesium formamidi- 
nium lead iodide bromide [CsojgFAp.goPb(1, 
Br)3] absorber under light excitation (Fig. 1A). + 
Flat glass/indium tin oxide (ITO) substrates ‘ 
were first coated with a HTL, either 2,2’,7,7'- 
tetra(N,N-di-tolyl)amino-9,9-spiro-bifluorene 
(spiro-TTB), N4,N4,N4”,N4”-tetra([1,l-biphenyl]- . 
4-yl)-[1,7:4,1’-terphenyl]-4,400-diamine (TaTM), 
or Me-4PACz (fig. S1). These three materials 
were selected on the basis of their compati- 
bility with textured surfaces. The first two 
can be thermally evaporated, and the latter can 
self-assemble on ITO to yield a conformal coat- 
ing on textured surfaces (27). We then de- 
posited a perovskite thin film using a hybrid 
two-step deposition method (16). Specifically, 
a CsBr/PbI, template was coevaporated on either 
flat glass substrates or glass/ITO/HTL stacks 
before spin coating an organo-halide solution 
consisting of FABr and FAI and, in some cases, 
with FBPAc. A thermal annealing step in an 
ambient atmosphere at 150°C converted the 
films into a perovskite phase (see the experi- 
mental section for details). In view of making 
tandems, a perovskite composition with an 
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optical E, of ~1.70 eV was selected (fig. S2). 
Some of these stacks were then completed by 
a Cgo electron-transport layer (ETL) thermally 
evaporated in high-vacuum conditions. 

The QFLS of bare perovskite layers depo- 
sited on glass (left column) remains constant 
when incorporating FBPAc (Fig. 1A). When 
inserting the HTL (Fig. 1A, hollow diamond), 
the addition of FBPAc increases the QFLS 
slightly with all three HTLs (+20 meV). Overall, 
QELS losses relative to the glass/perovskite 
reference are highest on spiro-TTB (—80 and 
-45 meV compared with the glass/perovskite 
reference, without and with FBPAc, respec- 
tively), followed by TaTm (—65 and —-40 meV), 
and lowest on Me-4PACz (—25 and -10 meV). 
These differences suggest improved surface 
passivation with Me-4PACz (9). Without 
FBPAc, the addition of the Cgo ETL (Fig. 1A, 
filled diamond) on the perovskite leads to 
substantial losses in QFLS (—100 to -50 meV 
in addition to the HTL-induced losses) caused 
by the formation of trap states in the first 
monolayer of Cg (22). On the other hand, 
the incorporation of FBPAc in the perovskite 
film synthesis mitigates some of these ETL- 
induced losses, resulting in a +100-meV QFLS 
gain compared with FBPAc-free films. 

Overall, FBPAc suppresses Pb?*/Pb° sur- 
face defects, as shown by x-ray photoelectron 
spectroscopy (XPS) in Fig. 1B and figs. S3 to S5. 
The phosphonic group of FBPAc coordinates 
with Pb-related defects as also demonstrated 
elsewhere (20, 23, 24), effectively lowering the 
population of top-surface nonradiative recom- 
bination states (25). Furthermore, the positive 
impact of FBPAc is mainly observed when Cgo 
is present: FBPAc likely separates the perov- 
skite from the Co layer, reducing the popula- 
tion of deep trap states forming in Cg when it 
is in direct contact with the perovskite (11, 22). 
It is important to note that some effects may 
affect the QFLS data presented in this work. 
First, the presence of oxygen and humidity 
during the annealing step and subsequent 
photoluminescence measurements may lead 
to a passivation of Pb-related defects (26). 
Second, x-ray diffraction (XRD) patterns show 
that the addition of FBPAc decreases the 
fraction of PbI, crystalline phases remaining 
at the end of the crystallization process (Fig. 1C 
and fig. S8)—an effect that should lower the 
photoluminescence quantum yield and hence 
the QFLS of films featuring the additive (27). 
Other effects are less likely to explain the QFLS 
trends we observed. For example, FBPAc and 
Cgo do not appear to chemically interact based 
on additional XPS data (fig. S6). Also, the 
electronic band structure of uncoated perovskite 
films treated with the additive shifts by ~0.5 eV 
because of the strong dipole moment of FBPAc 
(fig. S7), in line with literature (28). However, 
FBPAc is unlikely to lead to such a conduction 
band level offset in full devices considering the 


Chin et al., Science 381, 59-63 (2023) 7 July 2023 


A [wo FBPAcT_w FBPAc 


-100 


QELS loss (meV) 


a 
a 
o>) 


B 
Ss 
Ss 
4 w FBPAc 
no 
| 4 
2 
£ 
no 
a 
x< 
w/o FBPAc 
146 144 142 140 138 1 


binding energy (eV) 


w/o FBPAc| wFBPAc | w/o FBPAc| wFBPAc | w/o FBPAc | w FBPAc 


Me-4PACz 


XRD counts (a.u.) 


30 


40 
20 (°) 


50 


Fig. 1. Reduced nonradiative recombination losses and improved crystallographic properties 
when adding FBPAc. (A) QFLS loss of perovskite films (without and with FBPAc) extracted from 


photoluminescence quantum yield data relative to the 


glass/perovskite reference. Hollow diamond data 


points correspond to the QFLS of perovskite films deposited on different HTLs (spiro-TTB, TaTm, and 


Me-4PACz), and filled diamonds relate to films sandwi 
(B and C) High-resolution XPS spectra of the Pb 4f o 
with and without FBPAc (C). a.u., arbitrary units. 


improved solar cell performance achieved with 
FBPAc (see below). 


Modification of the perovskite crystallization 
kinetics and resulting microstructure 


To understand the impact of FBPAc on the 
perovskite crystallization process, grazing- 
incidence wide-angle x-ray scattering (GIWAXS) 
was performed in situ at 150°C starting from 
a CsBr/PbIy template freshly spin-coated with 
FAI/FABr (Fig. 2A and fig. $8). Although the 
crystallization onset of the perovskite (110) 
and (002) planes (q ~ 10.1 nm’) (29) is similar 
with or without the additive, FBPAc slows down 
the perovskite crystallization rate 100 s into 
the experiment (Fig. 2A). The intensity of these 
perovskite reflections then continues to in- 
crease throughout the experiment up to ~400 s— 
an effect not observed without the additive. 
Without FBPAc, the peak intensity stabilizes 
after 150 s and remains at a lower value. This 
modification of the crystallization pathway by 
FBPAc is also observed when analyzing PbI. 
peaks (Fig. 2A). As in Fig. 1C, their evolution 
confirms a more-complete conversion of the 


CsBr/PbI, template to a perovskite phase when 


iched by one of these HTLs and a C¢o ETL. 


bital (B) and XRD patterns of perovskite films 


using FBPAc. Overall, this two-step and glob- 
ally slower crystallization kinetics with the 
additive likely originates from the interaction 
between Pb** and FBPAc (20, 23, 24). FBPAc 
slows down the conversion of the CsBr/PbI, 
template to the perovskite phase because of 
competition between FBPAc and other perov- 
skite precursors (namely FAI and FABr) for a 
coordination with Pb”* (25, 30). Advantageously, 
this difference in formation kinetics results in 
films with apparently larger and more clearly 
defined perovskite domains because FBPAc may 
inhibit the nucleation of new perovskite grains 
or domains, as shown by top-view scanning 
electron microscopy (SEM) images (Fig. 2B). 
Submicrometer dark regions are observed 
on perovskite films featuring FBPAc (Fig. 2B). 
From gas-field ion-source helium and neon 
ion microscopy coupled with secondary-ion 
mass spectrometry (SIMS) (Fig. 2C and fig. S9) 
(31), these clusters are found to be rich in F 
and C, which is indicative of the clustering of 
the FBPAc excess on the perovskite surface. 
Films processed without FBPAc do not show 
any clustering and feature a lower amount of 
F and C on their surface (Fig. 2, C and D, and 
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Fig. 2. Impact of FBPAc on the perovskite crystallization and resulting microstructure. (A) GIWAXS 
data showing the evolutions of the (110) and (002) planes of the perovskite phase (indicated as PK in the 
figure) and of the (001) planes of the Pblz phase during the perovskite crystallization process at 150°C. 
(B) Secondary-electron SEM images highlighting the difference in surface morphologies when adding FBPAc, 
with arrows showing FBPAc clusters. (C) SIMS maps of the !9F~ and '*C2" signals. The bright and dark 


regions represent higher and lower, respectively, signal counts. (D) Corresponding histograms of the 


and /C2 SIMS signals shown in (C). 


fig. S9). Depth profiling combined with XPS 
reveals that FBPAc is confined to the top perov- 
skite surface (figs. S10 to S13). The bulky FBPAc 
molecules are expelled toward the top surface 
during the perovskite crystallization process. 

The large reduction in QFLS losses suggests 
that FBPAc is also present in between the 
surface clusters. This hypothesis stems from 
the logarithmic dependency between QFLS and 
surface recombination current (and hence non- 
passivated surface fraction, presuming that 
FBPAc is reducing the surface recombination 
current by a few orders of magnitude). If FBPAc 
was present only at the position of the clusters 
detected by SIMS, meaning that ~90% of the 
film surface would be free of FBPAc, the dif- 
ference in QFLS would be limited to ~3 meV 
[AQFLS = 25.7 meV x In(1/0.9) = 3 meV]. We 
observe a gain in QFLS of >50 meV when Cgo 
is present (Fig. 1A), highlighting that regions 
between the clusters also feature FBPAc. 
Overall, FBPAc passivates Pb-related defects 
and mitigates the formation of trap states in 
the Cgo layer by segregating on the perovskite 
top surface, with the excess forming clusters 
on the surface. 
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Perovskite single junctions and perovskite/c-Si 
tandem solar cells 

These perovskite films were incorporated 
into ~0.49-cm” glass/ITO/HTL/perovskite/ 
Cgo/bathocuproine (BCP) (or SnO,)/Cu (or Ag) 
single junctions to validate the improvements 
induced by FBPAc with current-voltage (J-V) 
measurements (1000 W/m?, AM1.5G spectrum, 
masked with a 0.25-cm” aperture; fig. S14). 
The open-circuit voltage (Vo) of the devices 
improves by 20 to 30 mV when switching 
from spiro-TTB to TaTm and Me-4PACz and 
by an extra 40 to 50 mV when adding FBPAc, 
in line with the QFLS results of Fig. 1A. The 
corresponding fill factor (FF) also increases 
on average from ~69 to ~76% when changing 
the HTL and then to ~78 to 81% with the 
additive, confirming that the molecule does not 
lead to the formation of a barrier impeding 
electron extraction. Overall, the combination 
of Me-4PACz and FBPAc results in a PCE of up 
to 19.5%. Varying the concentration of FBPAc 
from 2.5 to 7.5 mM in the FABr:FAI solution 
does not affect the performance substantially 
(fig. S15), which suggests that the excess of 
FBPAc (Fig. 2B) does not hinder charge trans- 


port across the perovskite/Cg, interface. Wash- 
ing away the FBPAc with an orthogonal solvent 
mixture results in perovskite cells with a simi- 
lar V,, to that of untreated films, which con- 
firms that FBPAc affects primarily recombination 
at the perovskite top surface (fig. S16). 

We then adapted the perovskite deposition 
conditions to produce 1-cm? tandem cells on 
silicon heterojunction bottom cells that had 
pyramids of 2 to 3 um on both sides of the 
wafer (Fig. 3A) (32, 33). Compared with our 
previous tandem baseline (16), the perovskite 
layer was made thicker (from ~650 nm to 
~1 um) to reach current matching conditions 
with an E, of ~1.7 eV (1.63 eV previously; Fig. 
3B). For that purpose, we deposited a CsBr/ 
PbI, template of 660 nm and adapted the 
spin-coating parameters to convert the tem- 
plate into a perovskite phase. The front elec- 
trode was made transparent by switching to 
a stack made of Cgo, SnO,, indium zinc oxide | 
(IZO), an Ag grid, and a MgF, antireflective 
coating (see materials and methods and sup- 
plementary text for details). 

A V,, of 1.9 V is obtained with either TaTM 
or Me-4PACz (fig. S17), which is consistent 
with single-junction cell results (perovskite 
top cells in fig. S14; c-Si bottom cell in fig. $18). 
The FF of TaTm-based tandems is lower than 
with Me-4PACz (<70% compared with >75%), 
likely because of the low hole mobility in 
undoped TaTm (34). We also investigated the 
impact of a slight change in the perovskite FE, 
by changing the FABr:FAI molar ratio from 
2 to 1.75 and 1.5 (corresponding to a E,. of 1.69, 
1.68, and 1.67 eV, respectively). A PCE >30% 
could be achieved with several of these perov- 
skite compositions, with slight variations in 
J,. and FF values depending on the top cell 
£, and the current-limiting subcell (figs. $17 
and S19). These tandems remained stable for 
>360 days when stored at room temperature in 
No (fig. S20). A selection of these unencap- 
sulated cells were then operated in ambient at 
their maximum power point at 65°C under a 
1-sun illumination (~20 to 30% relative hu- 
midity; ISOS-L2 protocol). The most stable 
device reaches a tgo of ~66 hours until 80% of — 
its initial PCE is measured (figs. S21 and S22). 
The degradation rate at 25°C of the FBPAc- 
treated tandem stack is similar to that achieved 
without FBPAc in our previous study (ISOS-L1 
conditions; figs. S21 and S22) (16). 

A tandem cell featuring Me-4PACz and 
FBPAc was sent to the National Renewable 
Energy Laboratory (NREL) for certification. 
The certified relative external quantum effi- 
ciency (EQE) shown in Fig. 3C indicates that 
the tandem was slightly top cell limited and 
that both subcells produced a.J,. > 20 mA/ cm?. 
The stabilized J-V properties of the device were 
measured using the asymptotic maximum 
power (Pyax) scan method in standard test 
conditions (STC) (25°C, 1000 W/m?, AM1.5G 
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Fig. 3. 31.25%-efficient perovskite/c-Si thanks to the combined action of Me-4PACz and FBPAc 


and a pyramidal texture on the front side of the tandem. (A) Schematic view of the perovskite/c-Si 
tandem stack and corresponding picture of the perovskite top cell on textured Si. The acronyms a-Si:H 
and ne-Si:H stand for amorphous and nanocrystalline hydrogenated Si, respectively, and (i/n/p) indicates 
the doping of the layers. (B) Secondary-electron SEM images of the front side of the tandem. (C to E) 
Independently certified EQE (C), asymptotic maximum power (Pina) scan (D), and current density and power 


density tracking around Pmax (E). 


spectrum; Fig. 3, D and E, and figs. S23 to S26) 
(35). In agreement with data measured in- 
house, the cell reaches a Vo, of 1.91 V, a Jee of 
20.47 mA/cm”, and a FF of 79.8%, which re- 
sults in a certified PCE of 31.25% (with an 
aperture area of 1.1677 cm”). 


Conclusions 


We identified and mitigated nonradiative 
recombination losses occurring at the inter- 
faces of perovskite/c-Si tandems featuring Si 
wafers with a micrometric texture—the stan- 
dard used in the c-Si PV industry. The use 
of Me-4PACz reduces voltage losses at the 
perovskite/HTL interface, whereas the inclu- 
sion of FBPAc in the perovskite deposition 
sequence reduces the voltage losses at the 
perovskite/Cg, ETL interface and leads to 
more favorable perovskite microstructures 
with larger domains. From XPS and SIMS 
imaging, FBPAc is present on the perovskite 
top surface and coordinates with perovskite 
Pb-related defects through its phosphonic acid 
group. Overall, the combination of a c-Si micro- 
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metric texture to improve optical properties, 
a high-quality 1-um-thick perovskite absorber 
deposited conformally on this texture using a 
hybrid two-step method, and phosphonic 
groups at both sides of the absorber for im- 
proved interface passivation resulted in a 
tandem with an independently certified PCE 
of 31.25%. These results demonstrate how c-Si 
solar cells with a standard industrial micro- 
metric texture can be upgraded to increase 
their PCE to >30%. 


REFERENCES AND NOTES 


1. “Best Research-Cell Efficiency Chart” (NREL, 2023); 
https://www.nrel.gov/pv/cell-efficiency.html. 

2. S. Schafer, R. Brendel, IEEE J. Photovolt. 8, 1156-1158 
(2018). 

3. L.C. Hirst, N. J. Ekins-Daukes, Prog. Photovolt. Res. Appl. 19, 
286-293 (2011). 

4. F. Fu et al., Adv. Mater. 34, e2106540 (2022). 

5. S. De Wolf et al., J. Phys. Chem. Lett. 5, 1035-1039 (2014). 

6. X.Y. Chin, D. Cortecchia, J. Yin, A. Bruno, C. Soci, Nat. Commun. 
6, 7383 (2015). 

7. S. D. Stranks et al., Science 342, 341-344 (2013). 

8. T. Jesper Jacobsson et al., Energy Environ. Sci. 9, 1706-1724 
(2016). 

9. A. Al-Ashouri et al., Science 370, 1300-1309 (2020). 


0. B. Chen et al., Joule 4, 850-864 (2020). 

J. Liu et al., Science 377, 302-306 (2022). 

2. P. Tockhorn et al., Nat. Nanotechnol. 17, 1214-1221 

(2022). 

3. Y. Hou et al., Science 367, 1135-1140 (2020). 

A. F. Gota, R. Schmager, A. Farag, U. W. Paetzold, Opt. Express 

30, 14172-14188 (2022). 

5. D. A. Jacobs et al., J. Phys. Chem. Lett. 10, 3159-3170 

(2019). 

6. F. Sahli et al., Nat. Mater. 17, 820-826 (2018). 

7. C. M. Wolff, P. Caprioglio, M. Stolterfoht, D. Neher, Adv. Mater. 

31, e1902762 (2019). 

8. J. Peng et al., Adv. Energy Mater. 8, 1801208 (2018). 

9. R. Azmi et al., Science 376, 73-77 (2022). 

20. E. Akman, A. E. Shalan, F. Sadegh, S. Akin, ChemSusChem 14, 

176-1183 (2021). 

21. M. RoB et al., Adv. Energy Mater. 11, 2101460 (2021). 

22. J. Warby et al., Adv. Energy Mater. 12, 2103567 (2022). 

23. A. A. M. Brown et al., Nanoscale 11, 12370-12380 

(2019). 

24. J. S. Kim et al., Nature 611, 688-694 (2022). 

25. T. Wang, J. D. A. Ng, E. W. Y. Ong, K. H. Khoo, Z. K. Tan, 

Adv. Opt. Mater. 10, 2101914 (2021). 

26. J. F. Galisteo-Lopez, M. Anaya, M. E. Calvo, H. Miguez, J. Phys. 

Chem. Lett. 6, 2200-2205 (2015). 

27. J. Euvrard, O. Gunawan, D. B. Mitzi, Adv. Energy Mater. 9, 

902706 (2019). 

28. |. Lange et al., Adv. Funct. Mater. 24, 7014-7024 (2014). 

29. L. Oesinghaus et al., Adv. Mater. Interfaces 3, 1600403 

(2016). 

30. T. Jeon et al., Adv. Energy Mater. 7, 1602596 (2017). 

31. T. Wirtz, O. De Castro, J.-N. Audinot, P. Philipp, Annu. Rev. 
Anal. Chem. 12, 523-543 (2019). 

32. F. Sahli et al., Adv. Energy Mater. 8, 1701609 (2018). 

33. L. Antognini et al., IEEE J. Photovolt. 11, 944-956 
(2021). 

34. C. Momblona et al., Energy Environ. Sci. 9, 3456-3463 
(2016). 

35. T. Song, D. J. Friedman, N. Kopidakis, Adv. Energy Mater. 11, 

2100728 (2021). 


B 


ACKNOWLEDGMENTS 


The authors thank P. Wyss and C. Allebé for SHJ wet-chemical 
processing and V. Paratte, B. Kamino, T. Wirtz, J. Hofkens, and 
M. B. J. Roeffaers for their valuable scientific and technical 
discussion. The authors also thank the staff of the BL11 NCD- 
SWEET beamline at ALBA Synchrotron for their assistance in 
recording the GIWAXS data. Funding: This study was supported by 
Swiss National Science Foundation grant 200021_197006 (PAPET); 
Swiss Federal Office of Energy grant SI/501804 (INTENT); Swiss 
Federal Office of Energy grant SI/502209 (PRESTO); Luxembourg 
National Research Fund (FNR) grant PRIDE17/12246511 (PACE); 
and Research Foundation - Flanders FWO, grant no. 12Y7221N, 
V400622N. C.M.W. thanks the European Commission for funding 
through a Marie Sktodowska-Curie Fellowship (no. 101033077). 
Author contributions: Conceptualization: X.Y.C. and Q.J. 
Experiments: X.Y.C., D.T., J.A.S., S.T., S.E., M.M., P.F., C.M.W., A.P., 
K.A., D.J., Q.G., F.S., GA., M.B., and Q.J. Project administration: 
X.Y.C., C.M.W., G.A., M.B., Q.J., and C.B. Supervision: X.Y.C., C.M. 
W., Q.J., and C.B. Writing - original draft: X.Y.C. and Q.J. Writing - 
review & editing: X.Y.C., D.T., C.M.W., J.A.S., S.T., S.E., M.M., P.F., 
AP., K.A., DJ., Q.G., F.S., G.A., M.B., Q.J., and C.B. Competing 
interests: X.Y.C. and Q.J. are inventors on a provisional patent 
related to the subject matter of this manuscript (EP22210577.7). 
The authors declare no other competing interests. Data and 
materials availability: All data are available in the main text or the 
supplementary materials. License information: Copyright © 2023 
the authors, some rights reserved; exclusive licensee American 
Association for the Advancement of Science. No claim to original 
US government works. https://www.science.org/about/science- 
licenses-journal-article-reuse 


SUPPLEMENTARY MATERIALS 
science.org/doi/10.1126/science.adg0091 
Materials and Methods 

Supplementary Text 

Figs. S1 to S28 

Tables S1 to S4 

References (36-45) 


Submitted 27 November 2022; accepted 5 May 2023 
10.1126/science.adg0091 


4 of 4 


RESEARCH 


SOLAR CELLS 


Interface engineering for high-performance, 
triple-halide perovskite-silicon tandem solar cells 


Silvia Mariotti'}+, Eike Kéhnen’y, Florian Scheler’, Kari Sveinbjérnsson’, Lea Zimmermann’, 
Manuel Piot”, Fengjiu Yang’, Bor Li?, Jonathan Warby°, Artem Musiienko’, Dorothee Menzel’, 
Felix Lang®, Sebastian KeBler’, Igal Levine’, Daniele Mantione*®®, Amran Al-Ashouri_, 
Marlene S. Hartel’, Ke Xu?, Alexandros Cruz’, Jona Kurpiers?, Philipp Wagner’, Hans Kébler’, 
Jinzhao Li’, Artiom Magomedov’, David Mecerreyes”®, Eva Unger’, Antonio Abate’, 

Martin Stolterfoht®, Bernd Stannowski?, Rutger Schlatmann’, Lars Korte’, Steve Albrecht!** 


Improved stability and efficiency of two-terminal monolithic perovskite-silicon tandem solar cells will require 
reductions in recombination losses. By combining a triple-halide perovskite (1.68 electron volt bandgap) with 
a piperazinium iodide interfacial modification, we improved the band alignment, reduced nonradiative 
recombination losses, and enhanced charge extraction at the electron-selective contact. Solar cells showed 
open-circuit voltages of up to 1.28 volts in p-i-n single junctions and 2.00 volts in perovskite-silicon tandem solar 
cells. The tandem cells achieve certified power conversion efficiencies of up to 32.5%. 


Ithough the photovoltaic (PV) devices mar- 

ket is currently dominated by crystalline 

silicon (c-Si) devices, perovskite-silicon tan- 

dem architectures have gained attention 

to increase their power conversion effici- 
ency (PCE). Two-terminal devices that directly 
connect a c-Si bottom cell and perovskite top cell 
use light more efficiently because high-energy 
photons can be absorbed by the perovskite top 
cell, and low-energy photons transmitted by the 
perovskite can be absorbed by the c-Si bottom 
cell (1). This approach recently led to a PCE of 
33.7% (2). However, given that the detailed 
balance limit of monolithic perovskite-silicon 
tandems is 45.1% (3), further improvements 
could be made, especially concerning the open- 
circuit voltage (Voc) U, 4). To reach the highest 
possible efficiency under the air mass coefficient 
(AM) 1.5G solar spectrum and current-matching 
conditions, the optimum band gap Eg of the pe- 
rovskite is 1.73 eV (3). However, finite perovskite 
thickness and parasitic absorption reduce the 
optimal perovskite band gap to ~1.68 eV (5-8). 
Perovskite compositions with this band gap 
require high bromide contents and suffer from 
instability caused by phase segregation that 
lead to large Voc deficits (7). To address this 
problem, we combined an efficient perovskite 
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composition with an interface modification. 
Thorough analysis reveals the mechanism 
leading to high Voc-values and hence, high 
PCE-values in single-junction solar cells and 
perovskite-silicon tandem solar cells. 


Perovskite compositions and surface treatment 


A widely used perovskite composition is the “triple- 
cation” (3Cat) perovskite Cs995(FAo.7MAo23)o.95 
Pb(Io.77Bro.23)3, Which contains cesium (Cs), 
methylammonium (MA), and formamidinium 
(FA) as cations and mixed bromine (Br) and 
iodine (I) anions (7, 8). However, the so-called 
“triple-halide” (3Hal) composition Csg..0.FAo 7g 
Pb(Io.ssBro15)3 + 5% MAPbCl, substantially 
reduces the MA content, which is associated 
with instability, while increasing the amount 
of Cs and using three halides (adding Cl’ to 
and Br ) to improve the quality of the perovskite 
crystal lattice, stability (9, 10), and minimized 
phase segregation (17). In addition, the 3Hal 
has shown substantial improvement of quasi- 
Fermi level splitting (QFLS) and a low ideal- 
ity factor (12) compared with those of more 
common perovskite compositions, which are 
factors that generally determine high Vo¢ val- 
ues. By naming this perovskite 3Hal, we do not 
exclude the presence of three cations, but we 
emphasize the presence of three halides that 
confer the positive properties to this perov- 
skite. Previous reports showed that this 3Hal 
perovskite enables efficient tandem cells with 
a PCE of 27% (13) and has a pseudo-PCE 
[obtained with photoluminescence (PL) mea- 
surements] of 33.4% (versus 32.6% for tan- 
dems with 3Cat perovskite) (72). Thus, 3Hal 
was used for this study. 

Besides tuning the composition of the 
perovskite layers to improve performance, ad- 
ditives are particularly effective in passivat- 
ing electronic traps and eliminating current 
density-voltage (JV) hysteresis in devices (J4). 
Chemical passivation of the perovskite layer 


nm. 
can reduce nonradiative recombination Ic Chee 
that occur in the bulk of the film (particu\—_, 
at the grain boundaries) or at the interfaces with 
the charge transport layers (CTLs), which can 
improve device Voc and stability (5-17). We 
focused on piperazinium iodide (PI), an ionic 
liquid previously reported by Li et al. (18) for 
use with a FA-rich 3Cat perovskite. The syn- 
thesis and characterization of the PI molecule 
are presented in the supplementary materials 
(fig. $1). Li et al. showed that PI contains both 
electron acceptor (R2,NH,") and electron donor 
(R.NH) functional groups that can interact 
with both positive and negative perovskite 
surface traps caused by Pb”* and halide ions, 
reducing nonradiative recombination (78). In 
addition to the chemical passivation of the 
perovskite surface, Li et al. show that the 
formation of a dipole between the perovskite 
and the electron-transport layer (ETL) par- 
ticipates in reducing Voc losses to 330 mVin | 
p-i-n devices (78). 
We combined the 3Hal perovskite with 
a bandgap suitable for tandem integration 
(1.68 eV) with PI interfacial modification and 
compared it with the commonly used lithium 
fluoride (LiF) interlayer (19, 20). As do other 
alkali-earth metal fluorides, such as MgF», LiF ‘ 
acts as a contact displacer that improves the 
device performance by favorably adjusting the 
surface energy of the perovskite layer in con- 
tact with the fullerene-based ETL (Cgo) (17). By‘ 
contrast, PI shows a faster charge extraction and 
reduced nonradiative recombination losses 
and creates a more favorable band alignment 
between the perovskite and the Cg) ETL by form- 
ing a positive dipole, while no chemical passiva- 
tion of the perovskite surface is detected. This 
modification allows Voc values of up to 1.28 
and 2.00 V and PCE values of 21.5 and 32.5% 
for perovskite single-junction and perovskite- 
silicon tandem solar cells, respectively. ‘ 


Electronic properties of 3Hal films with surface 
passivation and charge-transport layers 


Several characterization methods were used to . 
determine the film properties of the perovskite 
layer and the effect of the surface treatment, 
with and without CTLs. The following three 
configurations were considered, with either no 
surface treatment or PI or LiF surface treat- 
ment: (i) quartz glass with the perovskite di- 
rectly deposited onto this substrate; (ii) “half 
stack” devices (glass/ITO/2PACz/3Hal), where 
ITO is indium tin oxide and 2PACz is [2-(3,6- 
dimethoxy-9H-carbazol-9-yl)ethyl phosphonic 
acid; and (iii) “full stack” devices (glass/ITO/ 
2PACz/3Hal/Cgo) (2D. 

We performed PL measurements on the three 
configurations mentioned above to obtain PL 
quantum yield (PLQY) and quasi-Fermi level 
splitting (QFLS) values (table S1). The results 
show that the highest QFLS was obtained for 
the LiF samples on quartz (1.33 eV) and in the 
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Fig. 1. Electronic properties of 3Hal perovskite films with surface treat- 
ment and charge-transport layers. (A) Pseudo-JV curves for samples without 
surface modification, with LiF and Pl, both for single films on quartz and for 
complete device stacks. (B to E) Energy-level alignment of the 3Hal-Pl and 
3Hal-Pl-Ceq interface, obtained by means of UPS (hv = 6.5 eV) and CFSYS. The 
energetic positions of the 3Hal perovskite are obtained from the respective 
drawn layer stack—for (D) and (E), the perovskite VBM obtained from below the 


half stack (1.29 eV) configurations, but for the 
full stack, the PI sample shows the highest 
QELS (1.27 eV). From the intensity-dependent 
PL results, pseudo-JV curves were constructed 
(Fig. 1A). For samples on quartz, LiF achieved 
the highest PLQY (9%) of the series, whereas 
both reference and PI samples show similar 
PLQY (~5%). The negligible impact of PI on the 
PLQY and pseudo-JV of perovskite-on-quartz 
samples rules out a chemical passivation ef- 
fect, but interface recombination occurs in the 
full stack with ITO and the CTLs. Warby e¢ al. 
demonstrated that the recombination is mainly 
caused by the presence of Cg (ETL) (27). Al- 
though perovskite-on-quartz samples show 
similar pseudo-JV curves and hence similar 
pseudo-PCE for samples with and without treat- 
ment, in full stack configuration we observed 
a higher pseudo-PCE for PI owing to a higher 
QELS. The QFLS of the full-stack at 1 sun equiv- 
alent illumination are 1.27, 1.23, and 1.18 eV for 
samples with PI treatment, with LiF treat- 
ment, and without treatment, respectively. 
The beneficial effect of the PI occurred only 
with Cg, as ETL, indicating that PI used on 
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3Hal perovskite reduces across-interface recom- 
bination rather than directly passivating the 
perovskite surface. 

We performed near-ultraviolet (UV) photo- 
electron spectroscopy (UPS) measurements 
(hv = 6.5 eV, where h is Planck’s constant and v 
is the photon frequency) as well as with photon 
energies varying from 4 to 7 eV in the constant 
final-state yield mode (CFSYS) (22) to investigate 
the band alignment at the interface of 3Hal- 
Ceo with LiF or PI interface modification and 
Cgo thickness varying from 0 to 3 nm in the 
full stack samples. We applied a detailed model 
of the perovskite valence band maximum (VBM) 
in combination with the work function (WF) to 
precisely determine the ionization energy and 
thus assess possible dipole effects of the inter- 
face modification (figs. S2 and S3). The WF is 
a property of the topmost layer in the inves- 
tigated stack. For 3Hal-LiF, the WF decreased 
by ~-0.17 eV compared with the nontreated 
perovskite, which is consistent with our pre- 
vious study (22). However, the WF of 3Hal-PI 
increased compared with the reference sam- 
ple by 0.14 eV, and hence the PI-treated sur- 


3 nm Ceo is used. AEc* presented in (D) is the conduction band offset taken from 
(34). All values presented in the figure are in electron volts. (F to 1) Valence band 
region and occupied gap state density of 3Hal perovskite with LiF and PI for 
varying Cg thickness [0, 1, 2, and 3 nm for (F), (G), (H) and (I), respectively] 
measured with CFSYS on the stack ITO/2PACz/3Hal/Surf.Treat./Cgo. The energy 
reference zero was obtained by modeling the Fermi edge (E-*) at the sample 
surface observed in the spectra. 


face shows the highest WF of 4.60 eV (Fig. 
1C and table S2) (23). Kelvin Probe (KP) re- 
vealed larger WFs for samples with PI sur- 
face treatment compared with the untreated 
reference with increase of 0.24 and 0.15 eV 
with and without 2PACz, respectively (tables 
S3 and S4). 

Changes in the WF of a semiconductor sur- 
face can be caused by a shift in the Fermi level 
position within the band gap (through doping 
or charge redistribution), by a surface dipole, 
or both effects. We distinguished these effects 
by measuring the ionization energy /*, the 
difference between the perovskite VBM Ey 
and the vacuum level Ey... The formation of a 
dipole changes both WF and /* by the same 
amount, whereas a change in Fp within the 
band gap leaves J* unchanged. We determined 
the Fy and thus calculated /* not only on the 
neat, LiF-treated, or Pl-treated surfaces, but 
also underneath thin layers of Cgo (up to 3 nm), 
thus also accounting for potential charge re- 
distribution occurring at the interface. With- 
out Ceo, J* was 5.86, 5.84, and 6.21 eV for 
samples without an interlayer and with LiF 
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Fig. 2. Properties of 3Hal films and performance of single-junction solar 
cells with and without surface treatment. (A to C) tr-SPV graphs showing the 
possible charge separation mechanisms at the bottom, and (D to F) TRPL 
analysis performed on different stacks: (A) and (D) quartz/3Hal, (B) and (E) half 
stack (ITO/2PACz/perovskite), and (C) and (F) full device, that is, with a Ceo 
layer added, all considering 3Hal with and without LiF or Pl surface treatment. 
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The fluence was ~110 nJ cm”, corresponding to around 6 suns equivalent 
steady-state charge carrier density. For SPV, signals were measured after a 5-ns 
laser excitation with a fluence of 38.3 uJ cm™ and photon energy of 1.8 eV. 
For TRPL, single exponentials were used for fitting the curves. For the half stack 
and full device cases, the shaded areas indicate the data range in which 

mono-exponential fits were conducted to approximate the nonradiative decay 
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time (after the initial decay, before the background noise level). (G and H) Box 
charts presenting statistics of Voc and PCE respectively, of the p-i-n single junctions 
by using 3Hal perovskite with Pl and LiF treatment. (I) Aging test of perovskite 

single-junction devices according to ISOS-LC-1I, prepared by using 3Hal as absorber 


and PI, respectively. Although LiF might slight- 
ly reduce J*, PI increased it by ~350 meV, in- 
dicating the presence of an electrostatic dipole 
at the sample surface, as also observed by 
Li et al. (18) although oriented in the opposite 
direction. Possible reasons for this difference 
include the different chemical composition of 
the perovskite they used (3Cat, low bandgap) 
and the higher amount of unreacted PbI, on 
the perovskite surface detected with x-ray dif- 
fraction, caused by the loss of MA* or FA* 
cations during degradation processes. The var- 
iations were less pronounced with an addi- 
tional Cgo layer: the Cgo thickness-averaged 
I* of the perovskite underneath Cg, LiF-Cgo, 
and PI-Cgo were 5.88, 5.88, and 5.95 eV, re- 
spectively. Thus, whereas J* with and with- 
out LiF are similar within error, for PI, /* 
increased by ~70 meV. 

In Fig. 1, B to E, we combine the photo- 
electron spectroscopy (PES) results for the 
neat 3Hal and the 3Hal-Cgp interface with and 
without PI surface treatment. The sketched 
band bending in the perovskite is based on 
the assumption of a more intrinsic bulk (24), 
which the surface-sensitive PES measurements 
cannot probe. Considering the conduction band 
offset (AEc) as determined for the 3Cat-Cgo 
interface and using the slightly higher band 
gap energy of 3Hal, we obtained AE = -0.31 eV 
(Fig. 1D). The positive dipole induced by the PI 
molecule reduced this offset between the 3Hal 
conduction band minimum and the C¢go low- 
est unoccupied molecular orbital (LUMO) to 
+0.04 eV. We propose that the reduced inter- 
facial losses upon PI incorporation originated 
from changes in WF induced by a positive dipole 
(23), which shifts all energy levels of the Cgo at 
the interface, which in turn decreases the con- 
duction band offset and enables a high Voc. 

We also used CFSYS measurements to char- 
acterize the density of occupied states in the 
perovskite band gap. On samples without ETL 
(Fig. IF, O nm Co), we observed a similar gap 
state density for 3Hal and 3Hal-LiF, where- 
as for 3Hal-PI, it appeared to be slightly in- 
creased, especially near the Fermi edge. This 
finding agrees with PL measurements, which 
showed that the QFLS could only be increased 
with PI in the presence of Cgo, indicating the 
lack of chemical passivation with the PI treat- 
ment. For the samples with Cgg (Fig. 1, G to 
I, 1 to 3 nm) on the perovskite, the gap states 
formed at the interface contributed to the 
CFSYS spectrum because the probing depth 
is between 5 and 10 nm. The sample with LiF 
showed the highest gap state density for all 
three thicknesses (1, 2, and 3 nm), as observed 
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in our previous study (22). For 1 nm Cgo, the 
lowest gap state density was found for the PI- 
modified interface, which is consistent with 
PI reducing the nonradiative recombination. 
Nevertheless, for thicker Cg layers, the refer- 
ence layer stack showed the lowest gap state 
density. With these and the PL data previously 
presented, we attribute the positive influence 
of the PI to the formation of a positive dipole 
on the 3Hal perovskite. 

In solar cells, the extraction of photogener- 
ated electrons and holes occurs simultane- 
ously; hence, it is challenging to separate both 
processes. We measured transient surface photo- 
voltage (trSPV), which is proportional to the 
concentration of separated charge carriers at 
the respective CTLs, to study charge extraction, 
which appears as a fast initial rise in the SPV 
signal, and recombination, which appears as 
the decay of the SPV. Poor selectivity, slow 
carrier extraction, and severe trap-assisted re- 
combination also slow down the initial in- 
crease of the SPV signal and decrease the SPV 
amplitude (25, 26). Thus, we focused on the 
previous configurations—namely, perovskite 
films on quartz glass without CTLs, half stack 
and full stack devices—to disentangle the ef- 
fect of passivation on electron and hole ex- 
traction dynamics and recombination. The 
effect of surface treatment on perovskite films 
on quartz glass without CTLs is shown in Fig. 
2A. The PI-treated structure exhibited a sub- 
stantial negative amplitude, corresponding to 
the flow of electrons to the perovskite surface 
and free holes prevailing in the bulk of the 
perovskite film. The excess electron concen- 
tration near the surface was consistent with 
a ~200-meV-lower perovskite VBM for the 
PI-treated surface found from PES measure- 
ments, suggesting a higher n-type character 
of the surface (Fig. 1, B and C), as also reported 
by Li et al. (18). 

We found that the SPV signal for the PI 
sample depends on its surface coverage and 
that the maximum negative amplitude was 
observed for a 0.3 mg ml’ PI solution (fig. $4). 
The perovskite without any surface treatment 
showed a positive SPV signal revealing hole 
accumulation or trapping near the surface (ETL 
side of the structure). By contrast, the LiF treat- 
ment decreased the positive amplitude, imply- 
ing passivation of the perovskite surface and a 
reduced concentration of trapped or accumu- 
lated holes near the ETL surface. 

To investigate hole extraction, we studied 
the hole-transport layer (HTL)-perovskite in- 
terface in the half stack configuration (Fig. 
2B). The SPV signal shows an almost doubled 


layer with or without PI surface treatment. Devices were MPP-tracked in nitrogen 
at 25°C for 1020 hours under cycled light (12 hours dark, 12 hours illuminated). 
Averages of 11 (Pl) and eight (reference) solar cells are shown. The inset line 
corresponds to the 80% of the maximum PCE of the first cycle (PCEgo). 


amplitude and a steeper slope (increased ex- 
traction rate) for the PI-treated samples com- 
pared with the nontreated perovskite samples, 
which we attributed to reduced recombina- 
tion and improved electron charge selectivity 
at the ETL side (26), as observed for samples 
without any CTL (Fig. 2A). However, the LiF 
samples with HTL showed a slower or delayed 
rise in the SPV signal as compared with that of 
reference and PI samples. Similar responses 
were observed in the full stack configuration 
presented in Fig. 2C, in which free electrons 
and holes are extracted at the same time, into 
their respective transport layers. The PI-treated 
sample demonstrated the fastest rise and | 
largest amplitude, whereas LiF and reference 
samples showed a slower rise and lower am- 
plitude. Delayed charge extraction could be 
caused by a tunneling barrier to electron ex- 
traction introduced by LiF (77), which is avoided 
when using PI. 

Additionally, the time-resolved PL (TRPL) 
analysis in Fig. 2, D to F, showed a fast ex- 
traction process of about 10 ns that is followed 
by long tails (in the order of a few 100 ns toa 
few microseconds) that are governed by non- 
radiative recombination (7). The decay times 
extracted by mono-exponential fits to the tran- 
sient at later times (at ~2 to 5 us) allowed us to 
assess the nonradiative losses. 3Hal perovskite 
on quartz (Fig. 2D) showed a carrier lifetime 
(t) of 2 ws, which is in agreement with mea- 
surements on a 3Hal perovskite layer in our 
recent study (27). Compared with perovskite 
on quartz, a reduced t, was observed when add- 
ing 2PACz to the stack (from 2 to 0.4 us for the 
bare perovskite on quartz and half stack, re- 
spectively). In the half stack (Fig. 2E), the pe- 
rovskite with PI showed a decay time around 
twice as large as with a bare perovskite (~0.7 us), 
which indicated that PI slightly reduced non- 
radiative recombination losses at the perov- 
skite surface. When Cgp was added to the stack 
(Fig. 2F), the recombination losses increased 
because of the high gap state density and the 
energy level offset (0.31 eV) present at the 
perovskite-Cgo interface, further reducing t, from 
~400 to 200 ns, considering the bare perov- 
skite only. However, with PI, the decay time 
increased drastically (t, ~ 4 us) and was about 
five times longer than without Cgo (0.7 us). For 
LiF surface treatment on the full stack device, 
t does not change compared with the bare 
perovskite (both show 7 ~ 0.2 us). Thus, the 
TRPL transients suggest that PI at the con- 
centration of 0.3 mg ml” helps retain the high 
PL of 3Hal when paired with Cgo, as shown by 
the enhanced carrier lifetime. 
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Fig. 3. Performance of 
optimized perovskite- 
silicon tandem solar cells. 
(A) Sketch of the perovskite- 
silicon tandem device stack 
with a silicon (Si) hetero- 
junction bottom cell, trans- 
parent conductive oxide 
(TCO), self-assembled 
monolayer (SAM), triple- 
halide (3Hal) perovskite, 
lithium fluoride (LiF) or 
piperazinium iodide (PI), 
Ceo, tin oxide (SnOz), 
zinc-doped indium oxide 
(1ZO), LiF antireflective 
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measurements. (D) Certified (quasi-) steady-state JV measurement conducted at JRC-ESTI, including the performance parameters and a photograph of the tandem cell. 


Structural properties of 3Hal films with surface 
passivation and charge-transport layers 
Scanning electron microscopy (SEM) top view 
images of the perovskite films are shown in 
fig. S5. The 3Hal grains (200 to 300 nm wide) 
showed similar morphologies with and with- 
out PI, which indicated that PI formed either a 
film that is too thin to be detected or that it did 
not create a uniform layer on the perovskite 
surface. After LiF evaporation, agglomerates, 
likely LiF clusters, formed on the perovskite 
surface rather than forming a uniform layer 
(fig. S5D). In contrast to LiF, which has been 
shown to act as a contact displacer between 
the perovskite and the Cgp layer (17), for PI the 
presence of a layer on the surface is uncertain: 
Energy dispersive x-ray analysis (EDX) and 
x-ray photoelectron spectroscopy (XPS) showed 
a slight increment of the nitrogen (N), carbon 
(C), and hydrogen (H) signals and N-C bond, 
respectively, but this is not sufficient to deter- 
mine the presence of a PI layer and/or PI 
agglomerates (figs. S6 and S7). We performed 
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grazing-incidence wide-angle x-ray scattering 
(GIWAXS) (fig. S8) on perovskite films with 
and without surface treatment (both PI and 
LiF). Although the perovskite (100) pattern 
increases slightly for the PI-treated perovskite, 
the measurement reveals that post-deposition 
treatments did not influence grain growth. 
Therefore, we assume that PI only plays a role 
in improving the band alignment rather than 
acting as contact displacer. 


Single-junction solar cells 


We fabricated single-junction devices using the 
following device stack: Glass/ITO/2PACz (SAM)/ 
3Hal/[surface treatment]/Cg./tin oxide (SnO.)/ 
silver (Ag). Details regarding the device prep- 
aration are presented in the supplementary 
materials. Shown in Fig. 2, G and H, are the 
single-junction Voc and PCE values by using 
3Hal as absorbing material with and without PI 
and LiF (PI concentration optimization is shown 
in fig. $9, with an optimum at 0.3 mg mI’, and 
all device parameters are shown in fig. S10). 


Surface treatment of 3Hal with PI improved 
the Voc (median of 1.26 V) compared with LiF 
(median of 1.22 V). The high Voc values of up 
to 1.28 V (91.5% of the detailed-balance limit) 
obtained by using PI agree with the QFLS (Fig. 
1A and table SI) and results in a Voc deficit 
(Ec/q-Voc) of just 400 mV. This may be ex- 
plained by the modification of the WF (from 
UPS and KP) by a dipole, as discussed above 
and shown in Fig. 1E, fig. S3, and tables S2 
and S3. However, the light soaking effect shows 
almost no Voc decay after ~2 min of light ex- 
posure (fig. S11). 

The operational stability of devices was 
assessed in a light-cycling test (12 hours of 
illumination followed by 12 hours of dark) ac- 
cording to ISOS-LC-1I (28) at 25°C in nitrogen 
for 1020 hours (lamp spectrum is available 
in fig. S12) (29). These measurements can be 
considered more realistic than a constant 
illumination test because day-night light cycles 
also occur under real operation conditions out- 


doors. The light-cycled test is additionally able 
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to account for metastable behavior of PSCs. 
Maximum power point (MPP) tracks (average 
of 11 and 8 solar cells for PI and reference, re- 
spectively) and 80% of the maximum PCE of 
the respective first cycle (PCEgo) are shown in 
Fig. 21. The PI device shows a maximum value 
of 20.98% PCE in the first cycle and 17.84% in 
the last one. Reference devices show a maxi- 
mum value of 17.81% PCE in the first cycle and 
14.58% in the last cycle. Thus, PI devices retain 
85.03% and reference devices retain 81.91% of 
the initial maximum PCE. The evolution of 
the cycled behavior is shown in fig. S13, where 
for each cycle a different time to plateau is ob- 
served and is dependent on the type of device. PI 
devices show a faster rise of PCE for all con- 
sidered cycles. A faster transient behavior is 
desirable because devices reach their max- 
imum efficiency faster, and more energy is 
produced within a light cycle. The slow tran- 
sient behavior in the beginning of a light cycle 
has shown to be caused by the slow migration 
of cations (30). 


Perovskite-silicon tandem solar cells 


A schematic of the tandem configuration stack 
is shown in Fig. 3A, and the detailed pro- 
cedure steps are presented in the supplemen- 
tary materials. 

Tandem devices with PI could confirm higher 
maximum Voc values (1.95 V) compared with 
that of LiF (1.90 V) (fig. S14). A continuously 
MpPP-tracked (in air at 23° to 32°C) tandem cell 
with PI retained 80% of its initial PCE (28.2%) 
after 347 hours and 75.7% after 478 hours 
(fig. S15). 

We implemented several optimizations sep- 
arately and combined all of them to improve 
performance. First, the device fabrication was 
optimized. The self-assembled monolayer (SAM) 
Me-4PACz {[4-(3,6-dimethyl-9 H-carbazol-9-yl) 
butyl]phosphonic acid} was used as HTL for 
the top cell, together with a wetting-improving 
additive (7, 31). In contrast to our previous 
work (7), in this study this HTL mainly improved 
median Voc values from ~1.92 V (2PACz) to 1.94 V 
(Me4PACz) (fig. S16), agreeing with QFLS mea- 
surements (table S1). We also increased the 
photogenerated current density of the silicon 
bottom cell U/s;) by adding a reflector with a 
dielectric buffer layer (RDBL) on the rear of 
the bottom cell, as previously implemented by 
Cruz et al. on silicon single-junction (32) and 
tandem solar cells (8). This set of modifications 
enabled a PCE of 29.9%, as presented in fig. S17 
together with the MPP tracking, which shows 
stable values over 3 min. 

For one tandem cell, a thicker (160 nm) LiF 
antireflective coating (ARC) was evaporated 
(compared with 100 nm for baseline cells) 
and led to a stronger perovskite limitation, 
with photogenerated current densities of 
19.20 and 20.76 mA cm” in the perovskite 
and silicon, respectively. These cells are par- 
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ticularly suitable for stability measurements 
because the strong perovskite limitation will 
immediately affect the performance of the 
tandem cell if the perovskite subcell changes. 
The cell was stored in dark in an N.-filled 
glovebox, and JV and MPP (10 min) were 
regularly measured. After 3000 hours, the cell 
retained 99.6% of its initial efficiency (29.2%) 
(fig. S18). 

We also reduced the front transparent con- 
ductive oxide (TCO) thickness to reduce para- 
sitic absorption and reflection losses, which 
increased short-circuit current density (Jc), 
and added grid fingers to maintain lateral 
conductivity and therefore fill factor (FF). 
We optimized the number of grid fingers and 
zinc-doped indium oxide (IZO) thickness by 
performing optical and electrical simulations 
and then verified the results experimentally. 
Reducing the IZO from 100 to 40 nm while 
adding three grid fingers with a pitch of 
~2.6 mm enabled a relative PCE gain of ~3%. 
More details on the optimization can be found 
in the supplementary materials, materials and 
methods (figs. S19, S20, and S21 and table S5). 
A direct comparison of the external quantum 
efficiency (EQE) and reflection between 100 and 
40 nm is shown in Fig. 3B. 

These optimizations enable median values 
(20 cells) of 20.09 mA cm” 7Jgc, 78.27% FF, 1.97 V 
Voc, and a PCE of 31.1% (Fig. 3C and fig. S22). 
Three-min MPP tracks of all cells are shown 
in fig. S23 and demonstrate stable devices. 

To further improve the optical performance, we 
decreased the thickness of Cgp from 18 to 16 nm 
and SnO, from 20 to 10 nm. For all tandems 
presented in this work, we applied a low sputter- 
damage approach based on a reduced applied 
radio frequency (RF) power density, which reduces 
the kinetic energy of damaging particles on the 
thin Cg and SnO, (33). EQE spectra are shown 
in fig. S24, demonstrating high photogenerated 
current densities of 20.3 and 21.2 mA cm™~ for 
the perovskite and silicon subcell, respectively. 
The improvement in Jgc, although maintaining 
Voc and FF (Fig. 3C), enabled to increase the 
PCE to certified 32.5% + 1.0% (figs. S25 and S26), 
which corresponds to 76% of the detailed-balance 
limit. The (quasi) steady-state measurement per- 
formed at Joint Research Centre-European 
Solar Test Installation (JRC-ESTD is shown in 
Fig. 3D together with a photo of the tandem 
device and the performance parameters. The 
measurement at JRC-ESTI was identical to 
the in-house measurement (fig. S27). 

We performed pseudo-JV measurements with 
subcell-selective intensity-dependent PL and 
injection-dependent electroluminescence (EL) 
(figs. S28, S29, and S30 and table S6). QFLS val- 
ues (Voc values from pJV) of ~0.72 and ~1.27 eV 
were measured for the silicon and perovskite 
subcell, respectively. The sum of 1.99 eV is in 
good agreement with the JV-measured Voc 
(1.98 V). The perovskite pseudo-FF of up to 87.8% 


enables a tandem pseudo-FF of 84.2% and hence 
a pseudo-PCE of 33.8%. The good agreement 
between the EL and PL-based pJV curves dem- 
onstrate well-aligned bands (72). 


Discussion 


We demonstrated the advantage of PI modi- 
fication over the commonly implemented LiF 
interlayer. As revealed with sensitive constant 
final state yield spectroscopy measurements, 
the PI did not influence the density of oc- 
cupied gap states at the interface, whereas 
LiF induced a higher occupied gap-state den- 
sity in the Cgo, which is consistent with our 
previous observations (22). This still allows 
for an improved interface passivation with LiF 
because this interlayer not only induces a more 
asymmetric charge carrier concentration at 
the interface (22), but theoretical consider- 
ations also suggest that a LiF “spacer” re- 
duces wave function overlap (/7) and hence 
the nonradiative across-interface recombi- 
nation through these additional gap states 
in the Cgo (27). Under this specific perovskite 
composition (3Hal) and PI concentration 
(0.3 mg ml”), the main effect of PI is not a 
chemical passivation of the perovskite surface 
but rather allowing for the offset between the 
conduction band (CB) and lowest unoccupied 
molecule orbital to be substantially reduced 
(by ~350 mV). Hence, the quasi-Fermi level 
of electrons can move closer to the perovskite 
CB edge. In addition, from the point of view 
of charge separation, the surface photovolt- 
age measurements show increased electron 
selectivity at the perovskite surface in the 
presence of PI. We are aware of the different 
working mechanism of PI on the perovskite 
surface compared with that proposed by 
Li et al. (18), and we believe that this dif- 
ference is caused by the different perovskite 
composition and the concentration of PI used 
(1 mg ml” in their work, and 0.3 mg mI in 
this work). 

Implementing high-quality perovskite mate- 
rial and using PI as a suitable surface treatment 
strategy allowed us to achieve an exceptionally 
high Voc for single junctions (up to 1.28 V, 
which corresponds to 91.5% of the detailed- 
balance limit) and tandems (up to 2.0 V). The 
tandem cells retained 80% of the initial effi- 
ciency (28.2%) after Tg9 = 347 hours and 75.7% 
after 478 hours of continuous MPP tracking. 
In addition, by combining several optimiza- 
tion processes, we obtained highly efficient 
tandem devices, with a certified PCE of 32.5%. 
Subcell-selective measurements revealed a 
pseudo-PCE of 33.8%. Further improvement 
could be achieved for a double-side textured 
cell, for which the improvement in Jgc is ex- 
pected to be ~0.5 mA cm ® (~2.5% relative 
gain) (6). 

We provide insight on the working mecha- 
nism and consequent improvement of perovskite 
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and silicon-perovskite solar cell performance 
using 3Hal with PI. Although our results show 
a slight improvement of the stability com- 
pared with those of reference devices, we be- 
lieve that further studies may give positive 
perspectives for future commercialization. 
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Translation dynamics in human cells visualized at 
high resolution reveal cancer drug action 


Huaipeng Xing”, Reiya Taniguchi’, Iskander Khusainov’, Jan Philipp Kreysing”°, 


Sonja Welsch‘, Beata Turofiova’, Martin Beck!>* 


Ribosomes catalyze protein synthesis by cycling through various functional states. These states 

have been extensively characterized in vitro, but their distribution in actively translating human cells 
remains elusive. We used a cryo-electron tomography-based approach and resolved ribosome structures 
inside human cells with high resolution. These structures revealed the distribution of functional states 
of the elongation cycle, a Z transfer RNA binding site, and the dynamics of ribosome expansion 
segments. Ribosome structures from cells treated with Homoharringtonine, a drug used against chronic 
myeloid leukemia, revealed how translation dynamics were altered in situ and resolve the small 
molecules within the active site of the ribosome. Thus, structural dynamics and drug effects can be 


assessed at high resolution within human cells. 


he eukaryotic ribosome (80S) consists of 

two subunits (60S and 40S) that trans- 

late mRNA into proteins (J). Purified 80S 

ribosomes have been extensively studied 

in vitro, which has elucidated molecular 
details of translation (2, 3). The translation 
process can be divided into four main stages: 
initiation, elongation, termination, and ribo- 
some recycling (3). During elongation, the 
rotation of the 40S, the association of elonga- 
tion factors, and the translocation of tRNAs 
are coordinated to synthesize nascent chains 
(3). tRNAs have three canonical binding sites 
on the ribosome: aminoacyl (A), peptidyl (P), 
and exit (E) sites (3). A noncanonical tRNA 
binding site called the Z site, located in an 
extreme position past the E site, has been 
identified in structures of isolated transla- 
tionally inactive mammalian ribosomes (4). 
It is speculated that the Z site may represent 
a late-stage intermediate of tRNA ejection 
downstream of the E site with similarity to 
internal ribosome entry site interactions, but 
the exact physiological relevance remains ob- 
scure (4). 

The 70S ribosome from Mycoplasma pneu- 
moniae and 80S ribosome from Dictyostelium 
discoideum and the respective translation 
elongation processes have been visualized 
inside cells using cryo-electron tomography 
(cryo-ET), revealing differences in translation 
elongation (5-7). For example, the most abun- 
dant elongation intermediate in bacterial 
M. pneumoniae cells was the “A, P” state 
wherein tRNAs were identified in the respec- 


Department of Molecular Sociology, Max Planck Institute of 
Biophysics, 60438 Frankfurt am Main, Germany. “Faculty of 
Biochemistry, Chemistry and Pharmacy, Goethe University 
Frankfurt am Main, 60438 Frankfurt am Main, Germany. 
SIMPRS on Cellular Biophysics, 60438 Frankfurt am Main, 
Germany. “Central Electron Microscopy Facility, Max Planck 
Institute of Biophysics, 60438 Frankfurt am Main, Germany. 
‘Institute of Biochemistry, Goethe University Frankfurt, 
Frankfurt am Main, Germany. 

*Corresponding author. Email: martin.beck@biophys.mpg.de 


Xing et al., Science 381, 70-75 (2023) 7 July 2023 


tive positions. By contrast, the elongation 
factor bound “eEF1A, A/T, P” state was most 
abundant in eukaryotic D. discoideum cells. 
In microsomes isolated from human cells, 
the membrane-associated elongation cycle has 
been resolved, revealing the eEFIA, A/T, P, E 
state as the most prominent (8). However, to 
which extent these insights are applicable to 
actively translating human cells remains unclear. 


mRNA 


t. 


Homoharringtonine (HHT) is a nat Chee 
compound that binds to the ribosome ant. i 
hibits protein synthesis (9). It is used to treat 
patients with chronic myeloid leukemia clini- 
cally and as a reference to study new anticancer 
ribosome inhibitors (9, 10). The HHT-bound 
ribosome structure has been determined by 
incubating the purified 80S ribosome with HHT 
in vitro (11, 12), revealing the binding site at 
the peptidyl transferase center (PTC). In the 
cellular context, it remains unclear how HHT 
affects translation dynamics and how exactly 
it inhibits protein synthesis. We applied cryo- 
focused ion beam (cryo-FIB), cryo-ET and 
advanced data processing algorithms to de- 
termine the near-atomic structures of ribosomes 
and analyzed the abundance and organization 
of different ribosome states in untreated and 
HHT-treated human cells. 


Ribosomes are bound with HHT inside 
human cells 


To study the 80S ribosome structures inside 
human cells, we first prepared cryo-FIB-milled 
lamellae from 35 native (untreated) human 
embryonic kidney cells (HEK-293) and acquired 
358 tilt series (fig. SLA and table SI). We used 
template matching to identify ribosomes within ‘ 
the reconstructed tomograms (fig. SIB and 


Codon sampling 
Codon recognition ‘ 


18S rRNA 


Fig. 1. 80S ribosome structures in human cells. (A) Color-coded local resolution map of the eEF1A, A/T, 

P state identified in the untreated dataset. (B) Superposition of the codon recognition state (PDB 5LZS, cyan) 
onto our eEFIA-tRNA ternary complex at the eEF1A, A/T, P state. The atomic model of tRNA (lavender) and 
eEF1A (maroon) from PDB 4CXG were fitted into our map for comparison to 5LZS. (€) The mRNA segmented 
from the eEF1A, A/T, P state was fitted with PDB 5LZS. Density is more clearly defined at the P site as 
compared with the A site. (D) The atomic model (5LZS) of 18S rRNA (A1822 to A1827) was rigid-body-fitted 
into the corresponding density of the 80S at the eEF1A, A/T, P state. The decoding nucleotides A1824 and 
A1825 in the atomic model are in the flipped-out configuration, which is inconsistent with the observed 


density (arrowhead). 
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movie S1). After classification and refinement, 
we determined the ribosome structure at an 
overall resolution of ~3.2 A and locally resolved 
features up to 2.5 A in resolution from un- 
treated cells (Fig. IB and figs. S1 to S3). The den- 
sity of ribosomal protein side chains, ribosomal 
RNA (rRNA) bases, tRNA bases, and ions were 
well-resolved (fig. SIE), indicating the quality of 
the map. Using the same approach we processed 
352 tilt series from 32 HHT-treated cells. In the 
resulting structure the density of HHT was visi- 
ble (fig. S1, F and G) and the position of HHT at 
the PTC showed a high similarity to previous 
in vitro analysis (9, 11). The P-tRNA was poorly 
resolved compared with the untreated ribosome 
(fig. SIG), suggesting that the translation was 
altered after HHT treatment. 


Translation elongation cycle in human cells 


We next investigated the translation elonga- 
tion process. In untreated cells, focused classi- 
fication was performed with dedicated tRNA 
and elongation factor masks, resulting in eight 
ribosome states resolved from 3.4 A to 16.4 A 
(figs. S2 and S3 and table S2). These states 
were well-explained by previously reported 
structural models that captured various elon- 
gation states (figs. S4 to S6) (4, 8, 13-15). The 
eEF1A, A/T, P state was the best resolved, 
globally reaching ~3.4 A (Fig. 1A), and dis- 
played structural features characteristic for 
tRNA scanning, indicating that codon sampling 


eEF1A-tRNA 


40S N_ _— 
~p 
4.5% 

eEF1A-tRNA 


eEF1A, A/T, P,E 
13.0% 


ai? 


rather than codon recognition is occurring 
(Fig. 1B and fig. S4A) (73, 16). Specifically, the 
mRNA codon was more clearly resolved at 
the P site (Fig. 1C), suggesting that the codon- 
anticodon interaction at the A site is not yet 
established. Consistently, we observe the decod- 
ing nucleotides A1824 and A1825, which are 
thought to stabilize the A-site tRNA by a 
“flipping out” movement upon codon recogni- 
tion (13), in a “flipped in” conformation (Fig. 1D). 

In total, six classes—accounting for ~83% 
of all identified ribosomes—were assigned to 
the translation elongation cycle based on their 
similarity to the previously reported structures 
(Fig. 2A) (3, 5, 13, 16). The elongation cycle is 
thought to start from the non-rotated P state 
that was detected but not particularly abun- 
dant in our data (5) (Fig. 2A). It is followed by 
the eEF1A, A/T, P state discussed above which 
was the most prominent, consistent with 
in situ analysis of the lower eukaryote Dictyo- 
stelium discoideum (7), but in contrast to pre- 
vious analysis in bacteria (5). The abundance 
of the codon sampling state in human cells 
could be important for the higher decoding 
fidelity during translation (17). The following 
states were the non-rotated A/T, P and A, P 
states. At a low contour level of the map a factor- 
like density at the eEF1A binding region is seen 
in the A/T, P state, but neither an extended nor 
compact eEF1A model provides a sufficient 
explanation (fig. S&C, Materials and Methods) 


eEF1A, A/T, P 


39.6% 


R20 EEE 
eEF2, ap/P, pelE AS ee 
5.2% oo ie 2 


R1+R2.> 
‘A,P 
14.1% 


R1: Rolled R2: Rotated 


Fig. 2. Ribosome states in native untreated and HHT-treated cells. (A) Six 
ribosome states are assigned to the elongation cycle in untreated cells. (B) The 
eEF2 and eEF1A, A/T, P, Z states in untreated cells. (C) Three potential 
translation elongation intermediate states in the HHT-treated cells. The dashed 
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N: Non-rotated and unrolled 


<e 


eEF1A, AIT, P,Z 


(8, 13). As expected, the A, P state displayed the 
A1824 and A1825 nucleotides in a flipped out 
conformation (fig. S4D). Yet it was less abun- 
dant than in bacteria, where it was the most 
populated (5). The subsequent state was iden- 
tified as eEF2, ap/P, pe/E (18). Next, two 
possible sequences of events are conceivable: 
eEF2 and E-tRNA could first dissociate from 
the ribosome forming the P state that subse- 
quently binds the eEF1A-tRNA, or the eEFIA- 
tRNA may displace eEF2 with subsequent 
departure of the E-tRNA (5) (Fig. 2A). Various 
conformational changes of the small subunit 
are apparent during the elongation cycle (fig. 
85) and consistent with previous work (6, 8). 
Two further states were identified whose 
places in the elongation cycle are not obvious 
(Fig. 2B). The rotated eEF2 state without tRNA 
fitted the previous structural model very well 
(15, 19) (fig. S4G), although the eEF2 position 
was shifted by ~1.7 nm compared with the | 
eEF2, ap/P, pe/E state (fig. S41). The remain- 
ing class, which accounts for only 1.6% of all 
classified ribosomes (Fig. 2B), was consistent 
with the non-rotated eEF1A, A/T, P, Z state, 
which has been suggested to be transient (4). 
Although the resolution of this state was only 


moderate in untreated cells, the observed elec- ‘ 


tron optical density at the Z site fitted well with 
the purified ribosome at the non-rotated P, Z 
state (Fig. 2B and fig. S4, H and J) (4). To our 


knowledge, alternative factors binding to this * 


[___] Untreated 
{___] HHT-treated 


1.6% 


ay Compact eEF2 


eEF1A, AIT, P,Z 


6.7% 


arrow illustrates how the elongation states may connect. (D) The eEF2, 
eEFIA, A/T, P, Z and A/T, P, Z states inside the treated cells. R1, rolled; R2, 
rotated; N, nonrotated and unrolled. The information about 40S rolling and 
rotation is shown in figs. S5 and S11. 
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site have not been identified. More importantly, 
we obtained higher resolution in HHT-treated 
cells (see below), which further strengthened 
this conclusion. We note that ~12% of the 
particles identified as ribosomes were not at- 
tributed to a specific state by our classification 
and may correspond to additional low abun- 
dant states or transitions between states. Col- 
lectively, our data allowed the detailed analysis 
of the translation dynamics inside native hu- 
man cells, which exhibited differences compared 
with previous ex vivo studies (8, 78), indicating 
that the ribosome isolation process may affect 
the ability to capture native elongation inter- 
mediates (5, 7). 


HHT alters the elongation cycle 


The anticancer drug HHT blocks peptide 
bond formation (9, 20). Previous in vitro analy- 
sis of purified ribosomes revealed that HHT 
binds in the A-site cleft in the PTC (//, 27). 
Upon HHT treatment, one would thus expect 
to observe a strong enrichment of ribosomes 
with the A- and P-site tRNAs bound (/1). 
Indeed, M. pneumoniae ribosomes treated 
with 0.5 mg/ml (1547.4 uM) chloramphenicol 
(Cm), a related antibiotic binding to the PTC of 
the 50S, showed over 70% of ribosomes at the 
A, P state (5, 22). The effect of HHT on human 
ribosome states in cells remains unknown. To 
address this, we exposed cells to HHT concen- 
trations of 0.055 mg/ml (100 uM) in the medium, 
which resulted in a significantly lower cellu- 
lar protein concentration than untreated cells 
(fig. S7A), implying that protein synthesis was 
inhibited. At the time point examined by cryo-EM, 
the ATP levels, which serve as an indicator of 
cell viability, were not yet reduced (fig. S7B) 
and the cell morphology of untreated and 
treated cells was indistinguishable (fig. S7C). 


Hluntreated 
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Classification, as described above, identified 
six ribosome states resolved into the 3.7 A to 
11.5 A range (figs. S8 to S12 and table $3). 
Unexpectedly, the rotated state with eEF2 but 
without tRNA accounted for almost half of all 
ribosomes in the dataset (Fig. 2, C and D, 
and Fig. 3A). This state showed density for 
SERPINE1 mRNA-binding protein 1 (SERBP1) 
and thus accounts for hibernating ribosomes 
(15, 23) (fig. S1OG), in which the HHT density 
was visible (fig. S8C). The P state increased in 
abundance compared with the untreated cells, 
whereby the A, P states showed a similar abun- 
dance (Fig. 2C). The HHT molecule was also 
resolved at the A, P state but not in other less 
populated classes as a result of the lower 
resolution (Fig. 3B). We therefore combined 
the particles from the four remaining less abun- 
dant classes. The resulting average also displayed 
density for HHT (fig. S8C). The positions of 
A-tRNA and P-tRNA at A, P states were simi- 
lar in untreated and HHT-treated cells (Fig. 3B) 
(9). One of our classes appeared similar to a 
potential compact eEF2, A, P state (figs. S10, D, 
I, and J, and S11F) (24), although the respective 
local resolution prevents a definitive assign- 
ment due to a lack of secondary structure. 
Finally, two states with Z-tRNA were consider- 
ably more abundant compared with untreated 
cells (Fig. 2D and fig. S10, C and E), showing 
the typical features of tRNA shape at the Z 
site (fig. S10, E and H, and movie S3). Thus, 
HHT treatment results in the accumula- 
tion of ribosome hibernation instead of the 
A, P state, which may be representative of 
the mechanism of the drug action in cancer 
therapy. 

To investigate cell-to-cell variability, we ana- 
lyzed the distribution of the ribosome states 
across individual cells captured by multiple 


Fig. 3. Distribution of ribosome states in cells. (A) Percentages of ribosome states in untreated and 
HHT-treated cells. (B) The PTC of ribosomes at the A, P state in untreated and treated cells is shown 
fitted with PDBs 5AJO and 6QZP, respectively. Structural overlay of tRNAs at A, P states with models determined 
from previous studies (bottom; materials and methods). The black arrow points to P-tRNA. HHT is colored in cyan. 
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tomograms. We identified some degree of cell- 
to-cell variability (fig. SI3A), for example, the 
eEFIA, A/T, P state varied from 25 to 56% in 
untreated cells (fig. S13B). Overall, the signal 
observed in multiple tomograms of the same 
cell was similar, with some notable exceptions 
that may imply local variability (fig. S13B). 
Notably, the abundance of the eEFIA, A/T, P 
(class 1) and eEF1A, A/T, P, E (class 4) was 
largely anticorrelated in untreated cells (fig. 
S13, C and D), whereas the sum of eEF1A-bound 
states was relatively consistent, suggesting that 
its total concentration may be limited (fig. 
S13E). By contrast, the abundance of ribosomes 
containing eEF2 was much more diverse be- 
tween different cells (fig. S13E). 


Polysomes are impaired upon HHT treatment 


We analyzed the spatial organization of dif- 
ferent ribosome states within polysomes in 
treated and untreated cells. The densities ac- 
counting for neighboring ribosomes that are 
commonly observed in the subtomogram aver- 
ages of untreated cells were reduced in HHT- 
treated cells (fig. S14A). Furthermore, the 
ribosome distribution was more dispersed sub- 
sequent to treatment (Fig. 4, A and B), both 
implying that polysomes might be largely abol- 
ished. To test this hypothesis, we defined poly- 
somes based on a distance threshold of 9 nm 
from the mRNA exit to entry sites of neighbor- 
ing particles (fig. S14B and table $4, Materials 
and Methods). Although this arbitrary classifier 
will not capture the entirety of polyribosomes 
as it could, for example, miss more distantly 
spaced ribosomes on a longer transcript, over- 
all it performs well in their detection (fig. S15). 
Of all ribosomes in untreated cells, 30.2% were 
grouped into arbitrary polysomes (Fig. 4C and 
fig. S14, C and D), which was considerably higher 
than in HHT-treated cells (fig. S15, A to E). In 
monosomes the eEF2 state was prevalent (Fig. 
4D and fig. S16, A and B) and exhibits much less 
neighboring density (fig. $16, C and D), under- 
lining the notion that these ribosomes are hi- 
bernating. In polysomes, the eEF2, ap/P, pe/E 
state (class 3, see Fig. 3A) was less frequent in 
the first leading ribosome than the trailing ribo- 
somes (fig. SI5F). Classification of all untreated 
ribosomes with a dedicated mask resolved the 
low-resolution structure of the di-ribosome, in 
which the mRNA density was visible (fig. S17A) 
(25, 26). This di-ribosome resembled the top- 
to-top configuration (t-t, the central protuber- 
ance of both ribosomes facing a similar direction) 
(27, 28) (fig. S17B). Two other arrangements 
of pairs were apparent: one with the central 
protuberance of the i+1 ribosome toward down 
(t-d), the other toward up (t-u) (fig. S17, C and D). 
Although the abundance of these configurations 
declined in the treated cells (fig. S17, D and E), 
the center-to-center and exit-to-entry distance 
of these pairs were indistinguishable from un- 
treated cells (fig. S17F). 
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Fig. 4. Spatial and functional analysis of polysomes. (A and B) Distribution of ribosome states in a representative tomogram from an untreated (A) and 
treated (B) cell. (€) A polysome and the putative mRNA path. The center-to-center distance of the neighboring ribosomes: 27.3 + 1.8 nm (mean + SD, n = 8). 
(D) Distribution of ribosome states in monosomes and polysomes from 358 tomograms of untreated cells and 352 tomograms of treated cells. Class numbers 


are the same as in Fig. 3A. 


We classified ribosomes with a mask cover- 
ing the region beyond the exit tunnel, thus 
capturing both the potential membrane and 
the expansion segment ES27L (figs. S18 and 
$19, and tables S2 and S3). The analysis re- 
vealed that the translation state distributions 
of soluble and membrane-bound ribosomes 
were similar (fig. S1I8D). Notably, seven con- 
formations of the expansion segment ES27L 
were found in cytosolic ribosomes (fig. S19, 
Materials and Methods). One of these con- 
tains a long stretch of ES27L that associates 
with the ErbB3 receptor-binding protein (Ebp1) 
on the surface of the 60S and resembles a 
previously published in vitro structure (29, 30) 
(fig. S19, A and B, and tables S2 and S3). The 
percentage of Ebpl-associated ribosomes was 
below 20% in most untreated cells, whereas 
the abundance was over 25% in all HHT-treated 
cells (fig. S20A). Whether Ebp1 is related to spe- 
cific translation states remained unclear (29, 30). 
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Thus, a subfraction of Ebp1-bound ribosomes is 
observed in all apparent intermediates of the 
elongation cycle (fig. S20B). 


HHT binds the free 60S associated with elF6 


Finally, we investigated the free 60S and 40S 
in the cytoplasm of human cells. The free 60S 
showed Ebp1 and ES27L density in both data- 
sets (Fig. 5, A and B, fig. S21, A and B), which 
was similar to the Ebpl-associated 80S ribo- 
some. However, the 60S was much more abun- 
dant in treated cells than in untreated cells 
(Fig. 5C). Eukaryotic initiation factor 6 (eIF6) 
binds the 60S to prevent premature 60S bind- 
ing with 40S (3/1, 32). Our data show that the 
60S did not contain eIF6 in the cytosol of 
native untreated cells, contrasting HHT-treated 
cells (Fig. 5D). Thus, it appears that eIF6 
prevents the association between large and 
small ribosomal subunits in cells (33-35) 
(Fig. 5, C and D, and fig. $21, C to E). Furthermore, 


the HHT density was resolved in the 60S from 
treated cells (Fig. 5E). Thus, we conclude that 
HHT can bind not only the PTC of different 
states of the 80S ribosome (Fig. 3B and fig. 
S8C) but also the free 60S decorated with eIF6, 
which may block the assembly of 40S and 
60S in the cytosol of the cell. 


Discussion 


Our extensive cryo-ET analysis stresses the 
feasibility of obtaining structures at high res- 
olution and visualizes an anticancer drug 
inside human cells. The local resolution in the 
ribosome core in our study is limited by the 
pixel size. This illustrates that technical obsta- 
cles that hinder high resolution in cellular to- 
mography have been overcome. The combined 
improvements in energy filtering (36), image 
processing (Warp-RELION-M pipeline) (22) 
and tight control over specimen thickness 
(37, 38) enable high resolution, as long as the 
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Fig. 5. Structure of 60S in human cells. (A and B) Structures of free 60S in 
the cytoplasm of untreated cells (A) and HHT-treated cells (B). ES27L and Ebpl 
(PDB: 6SXO) are fitted into the 60S maps. Although Ebp1 was not confidently 
assigned to the 60S from untreated cells as a result of the lower resolution, it 
fitted better in comparison to alternative factors (amino-terminal acetyltrans- 
ferases and nascent polypeptide-associated complex) binding to the tunnel exit. 
(C) Percentage of the 60S in untreated and HHT-treated cells normalized to the 


number of particles that can be obtained is 
sufficient. Further improvements in the autom- 
atization of specimen preparation techniques 
may thus be instrumental in pushing the reso- 
lution for other macromolecular assemblies 
analyzed inside cells (38-43). 

The detailed comparison of ribosome struc- 
tures within untreated and HHT-treated cells 
revealed that HHT binds to the PTC of the 80S 
ribosome in situ (Fig. 3B and figs. S1G and 
S8C), where it can block peptide bond forma- 
tion as suggested by previous in vitro studies 
(9, 11, 21). However, we also observed SERBP1 
binding to ribosomes, a factor that functions 
downstream of mammalian target of rapamycin 
complex 1 (mTORCI1) and leads to a dormant 
ribosome state (44). Such a dormant ribosome 
state can be a result of cellular stress signaling 
(44). Further, we uncovered that HHT bound 
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the free 60S associated with eIF6 (Fig. 5, D and 
E), thus likely hampering the assembly of func- 
tional 80S ribosomes and in turn further impair- 
ing overall protein synthesis. The accumulation 
of 60S-eIF6 may be explained by the fact that 
the HHT binding site overlaps with that of 
the N terminus of Shwachman-Bodian-Diamond 
syndrome protein (SBDS) (fig. S21F) and that 
the binding of SBDS to 60S is necessary to 
release elF6 (34, 45). 

Our study provides insights into the trans- 
lation elongation cycle, the coordination of 
ribosome activity within polysomes, and the 
diverse arrangements of ES27L inside human 
cells. The same technology may be used in the 
future to investigate ribosome, translation, 
and mRNA quality control pathways in the 
context of human tissue culture cells and 
primary patient-derived cells. Intermediates 


15.5 


60S abundance (%) 


number of 80S ribosomes in the respective dataset. In untreated cells, 
abundance = 60S/(60S+80S) = 1,693/(1693+39,402). In treated cells, 60S/ 
(60S+80S) = 7176/(7176+39,070). (D) The structure of 60S in untreated cells 
fitted with PDB 6LSR indicates that elF6 is missing (left), contrasting HHT- 
treated 60S (right). Large subunit GTPase 1 (LSG1), NMD3 and ZNF622 (PDB: 
6LSR) are not observed in the HHT-treated 60S structure. (E) The PTC of free 
60S from untreated and treated datasets fitted with PDB 6QZP. HHT is colored in cyan. 


that have not yet been observed inside cells 
may be enriched by introducing kinetical 
bottlenecks through genetic or pharmaceu- 
tical perturbation (5, 46, 47). Our study may 
set the stage for the analysis of structures 
inside mammalian cells and allow characteri- 
zation of drug susceptibility of human indi- 
viduals at high resolution by cryo-ET. 
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Skeletal ring enlargement is gaining renewed interest in synthetic chemistry and has recently 

focused on insertion of one or two atoms. Strategies for heterocyclic expansion through small-ring 
insertion remain elusive, although they would lead to the efficient formation of bicyclic products. Here, 

we report a photoinduced dearomative ring enlargement of thiophenes by insertion of bicyclo[1.1.0]butanes 
to produce eight-membered bicyclic rings under mild conditions. The synthetic value, broad functional-group 
compatibility, and excellent chemo- and regioselectivity were demonstrated by scope evaluation and 
product derivatization. Experimental and computational studies point toward a photoredox-induced 


radical pathway. 


yclic organic compounds, especially het- 
erocycles, play a key role in pharmaceu- 

tical and natural products chemistry, as 

well as materials science (7-3). Ring ar- 
chitecture not only influences properties 

such as lipophilicity, three dimensionality, 
and scaffold rigidity but also determines 
molecular function and reactivity (J-3). Hence, 
the development of methods to precisely form 
and manipulate ring systems is of utmost im- 
portance in synthetic chemistry. Skeletal ring 
enlargement can convert common feedstocks 
into molecularly complex scaffolds and has 
the potential to induce important changes in 
the chemical or biological properties of the 
original compounds (4-6). Recently, the in- 
sertion of one or two atoms into cyclic mol- 
ecules for skeletal editing, an approach first 
developed in 1881 (7), has received renewed 
attention (8-16). Notable contributions from 
several groups have been reported in the con- 
text of skeletal editing by means of boron (JO), 
nitrogen (1, 12), or carbon (13-15) atom in- 
sertion (Fig. 1A). Two-step approaches for 
dearomative ring enlargement of polycyclic 
arenes through insertion of oxygen (77) and 
carbon (18) have also been demonstrated. 
Moreover, catalytic strategies for two-atom 
insertions into highly strained cyclic carbon- 
yls have emerged as a useful tool for the syn- 
thesis of bridged and fused rings (19) (Fig. 
1A). Extending this concept to the insertion of 
small rings into a heterocyclic parent ring 
(‘ring-in-a-heterocycle”) would allow the facile 
construction of bicyclic frameworks (Fig. 1B). 
The hetero(bi)cyclic moiety is the basic scaf- 
fold of numerous important biologically ac- 
tive natural products (3, 20, 21). Insertion of 


1Organisch-Chemisches Institut, Westfalische Wilhelms- 
Universitat Mtnster (WWU), 48149 Minster, Germany. 
Department of Chemistry and Biochemistry, University of 
California, Los Angeles, CA 90095, USA. 

*Corresponding author. Email: glorius@uni-muenster.de (F.G.); 
houk@chem.ucla.edu (K.N.H.) 


tThese authors contributed equally to this work. 


Wang et al., Science 381, 75-81 (2023) 7 July 2023 


C(sp?)-rich rings into heterocycles to form 
bicyclic compounds is appealing because sp” 
richness, three dimensionality, and increased 
conformational rigidity often facilitate the suc- 
cessful design of new drugs (7). From a synthetic 
perspective, bicyclo[1.1.0 Jbutanes (BCBs) are ideal 
sources of C(sp®)-rich rings because of their 
rigid and defined three-dimensional nature 
(22-27). The peripheral cyclization of unsaturated 
m-bonds with BCBs provides an approach to 
introduce bicyclic scaffolds in synthetic chem- 
istry (28, 29). Very recently, the cycloaddition 
of BCBs to strained cyclopropanes, leading to 
the formation of bicyclo[3.1.1Jheptanes, was 
reported (30, 31). The insertion of BCBs into 
heterocycles, which exist widely in nature, to 
forge heterobicyclic compounds has been se- 
verely curbed owing to the synthetic challenges 
such as activation sequence and chemo- and 
regioselectivity. Considering the applications of 
thiophenes in medicinal chemistry (32, 33), we 
envisioned that skeletal ring enlargement of 
such compounds, by means of radical insertion 
of BCBs, would provide a privileged platform for 
the construction of sulfur-based bicyclic scaf- 
folds and be a blueprint for additional efficient 
ring-in-a-heterocycle enlargement reactions 
(Fig. 1B). 

This abovementioned heterocycle enlarge- 
ment hypothesis would potentially operate 
by a cleavage-rebound sequence. A serious 
synthetic challenge, namely cleaving hetero- 
cycles while avoiding the ring opening of 
the four-membered carbocycles, requires this 
transformation to be carried out selectively 
and under mild conditions (Fig. 1B). To ad- 
dress this problem, a radical strategy trig- 
gered by photocatalysis can be considered, 
owing to its mildness and controllable char- 
acter (34-39). Because of the electron-rich 
character and high triplet energy of thio- 
phenes, any activation strategy making use 
of photoinduced energy transfer was deemed 
less feasible (40-43). Given that the excited- 
state acridinium salt {[Acr-Mes,]*[BF,] }* has 


a high oxidation potential (44-46), it r 


0 


Chec 


be used as a photoredox catalyst for the en 
cise activation of the thiophene rather than 
the BCB to form highly reactive radical cation 
intermediate I, which would attack the BCB 
and form another intermediate II or III. 
Then, eight-membered ring IV would be gen- 
erated through a ring opening-insertion 
process, followed by the electron-transfer re- 
duction to obtain the target molecule (Fig. 1C). 
Herein, a photochemical skeletal ring enlarge- 
ment of thiophenes toward an array of sulfur- 
based bicyclic rings using BCB as an insertion 
unit is presented. The synthetic value, broad 
functional-group compatibility, and outstand- 
ing chemo- and regioselectivity of this protocol 
were demonstrated by scope evaluation, facile 
millimolar-scale synthesis and synthetic de- 
rivatization (Fig. 1D). 


Reaction optimization 


We began our investigation with the use of : 


3-phenylthiophene (1a) and bicyclo[1.1.0]butan- 
1-yl(morpholino)methanone (2a) as reaction 
partners (see supplementary materials for de- 
tails; fig. $2). After optimization, product 3a 
was obtained in 80% yield after reaction under 
argon atmosphere for 12 hours with acridinium 
salt as a photocatalyst and acetonitrile (MeCN) as 
the solvent, and good regioselectivity [8a:3a! = 
94:6 regioselectivity ratio (r-r.)] was observed 
(entry 1, fig. S2). We also evaluated the effect of 
other reaction parameters on this skeletal ring 
enlargement. Varying the concentration of la 
resulted in fluctuations in the yields but little 
effect on regioselectivity (entries 2 to 4, fig. S2). 
Moreover, changing the concentration of reac- 
tion had no effect on the yield (entries 5 and 6, 
fig. S2). Additionally, other solvents, such as 1,2- 
dichloroethane (DCE), acetone, tetrahydrofuran 
(THF), N,N-dimethylformamide (DMF), and 
benzotrifluoride (PhCF3) led to a substantial 
decrease in the reaction yield (entries 9 to 13, 
fig. S2). We then investigated several photo- 
catalysts and found that only acridinium salt 
could drive this chemical conversion (entries 
14 to 18, fig. $2). Irradiation with visible light 
and the presence of photocatalyst were essen- 
tial in this reaction (entries 20 and 21, fig. S2). 
Under the abovementioned conditions, 3a was 
obtained as the predominant regioisomer over 
3a! (fig. S2). By applying a condition-based sensi- 
tivity screening approach (47) (Fig. 2), we found 
that the reaction toward compound 3a is sen- 
sitive to high oxygen concentration and water, 
while being generally tolerant toward higher 
temperature and varying light intensities (see 
fig. S6 for details). 


Substrate scope investigation 


With the optimal conditions in hand, substrate 
scope with respect to the BCBs was first eval- 
uated. As shown in Fig. 2, a series of cyclic 
amide-substituted BCBs proved compatible 
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Fig. 1. Background and concept. (A) Skeletal ring enlargement via one- or two-atom insertion. (B) Conceptual outline for the synthesis of heterocycle-based bicyclic 
compounds. (C) Mechanistic hypothesis for the electron transfer—induced insertion of BCBs into thiophenes. (D) Photochemical skeletal ring enlargement of 
thiophenes through insertion of BCBs (this work). cat., catalytic. 


with this transformation, affording the cor- 
responding products in good yields and regio- 
selectivities (3a to 3e). The reaction of la with 
2a was successfully performed at millimolar 
scale, which produced product 3a in 84% 
yield. Moreover, other amide groups proved 
to be compatible, as shown by products 3f to 
3k. An ester-substituted BCB reacted smooth- 
ly with la to afford product 31. Additionally, 
BCBs containing ketone and ether groups gave 
products in a synthetically useful yield (8m 
and 3n). Encouraged by the above results, 
we sought to examine the substrate scope 
with regard to thiophene derivatives (Fig. 2). 
High regioselectivities and good yields were 
observed with monosubstituted thiophenes 
as reaction partners (30 to 3s). A series of 
functional groups on the aromatic ring, such 
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as deuterium (30), ketone (3p), CF; (3q), fluoride 
(8r), and chloride (3s), proved compatible with 
this protocol, providing many opportunities 
for further diversifications. These examples 
show preferential insertion proximal to the 
aromatic substituent. Further functional-group 
compatibility of this method was demon- 
strated with 3,4-disubstituted thiophenes as 
substrates (3t to 3w). Reactants with alde- 
hyde and silicon-containing ancillary groups 
were converted to the products 3v and 3w. 2,4- 
Disubstituted thiophenes were subsequently eval- 
uated (3x to 3ac), and higher regioselectivities 
were obtained than with the 3,4-disubstituted 
thiophenes, perhaps owing to steric hindrance. 
Compounds with pyridine substituents were 
also compatible with the reaction system (3x 
and 3y). Thiophenes bearing an alcohol or long- 


chain alkyl group substitution at C2 could be 
transformed smoothly to the corresponding 
products through skeletal expansion (Sab and 
8ac). This conversion was successfully extended 
to trisubstituted thiophene, generating com- 
pound 3ad in good yield and superb regiose- 
lectivity. Notably, methyl-substituted thiophene 
was compatible as well (Sae). Unsubstituted 
thiophene reacted in moderate yields to fur- 
nish the corresponding products 3af and 3ag. 
Unfortunately, no desired product was detected 
with the use of 2-methyl-3,5-diphenylthiophene 
as substrate, which might be due to the steric 
hindrance caused by the double substitution 
at the 2,5-position. 

Subsequently, the substrate scope of benzo- 
thiophene derivatives was explored, and in- 
verted regioselectivities were observed relative 
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(X-ray) 


Fig. 2. Substrate scope for direct insertion of BCBs into thiophene derivatives. Yields are given for the isolated major products. Regioselectivity ratio is determined by 
proton nuclear magnetic resonance ('H NMR) analysis of the crude product mixtures. Reaction conditions: thiophene (1, 0.36 mmol, 3.0 equiv), BCB (2, 0.12 mmol, 

1.0 equiv), [Acr-Mes»]"[BF4] (8 mol %), argon, blue light-emitting diodes (LEDs; 450 nm) in MeCN (2 ml) for 12 hours at room temperature; *1 (4.0 equiv); +24 hours; 
t[Acr-Mes2]"[BF4] (16 mol %), 1 (4.0 equiv) for 24 hours. c, concentration; T, temperature; /, intensity; Ph, phenyl; Me, methyl; NBoc, N-tert-butyloxycarbonyl. 


to thiophenes (Fig. 3). The target product 3ah 
was obtained as a single regioisomer in 59% 
yield through ring-opening insertion. The ro- 
bustness of this protocol was demonstrated 
by its compatibility with various functional 


Wang et al., Science 381, 75-81 (2023) 7 July 2023 


groups (e.g., hydroxyl, silicon, phenyl, halogen 
groups) (Sai to 3ao). A larger-scale reaction 
successfully afforded product 3ai in 93% yield. 
Moreover, the scope was smoothly extended to 
a thiophene bearing an alkyne (Zap). An alkene 


group, which could have been reactive toward 
the photosensitizer, also proved compatible 
under standard conditions, giving product 
38av. Additionally, the cyclohexyl group was 
well tolerated, as shown by compound 3aq. 
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Fig. 3. Substrate scope for direct insertion of BCBs into benzothiophene derivatives. Yields are given for the isolated major products. Regioselectivity ratio is 
determined by 'H NMR analysis of the crude product mixtures. Reaction conditions: thiophene (1, 0.36 mmol, 3.0 equiv), BCB (2a, 0.12 mmol, 1.0 equiv), [Acr-Mes2]"[BF4]_ 
(8 mol %), argon, blue LEDs (450 nm), in MeCN (2 ml) for 12 hours at room temperature; *[Acr-Mes,]*[BF4] (16 mol %), 1 (4.0 equiv) for 24 hours. TBS, tert-butyldimethylsilyl. 


Substrates bearing pyridine and pyrimidine 
derivatives were amenable to this photochem- 
ical protocol, delivering corresponding products 
in excellent reaction yields (Sar to 3at). Fur- 
thermore, benzothiophene bearing an electron- 
withdrawing group proved to be a suitable 
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partner (3au). A substrate containing a men- 
thol fragment was also compatible (Saw). To 
further demonstrate the chemoselectivity and 
regioselectivity of this heterocycle enlarge- 
ment, complex molecules with two identical 
benzothiophene structures were treated with 


BCB (2a). Both reactions led to single-isomer 
products, respectively, demonstrating excep- 
tional chemo- and regioselectivity (ax and 3ay). 
In a substrate with both a benzothiophene and a 
thiophene moiety, the BCB reacted exclusively at 
the benzothiophene site (az). Stereochemical and 
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Fig. 4. Mechanistic studies. (A) Stern-Volmer luminescence quenching analysis. R*, coefficient of determination. (B) Cyclic voltammetry analysis. (C) Radical 
inhibitor experiment. (D) Quantum yield determination. (E) Computed reaction coordinate surface with 3-phenylthiophene. (F) Computed reaction coordinate surface 
with benzothiophene. DFT calculations were conducted at the UMO62X/def2TZVP/SMD(solvent = MeCN)//UM06-2X/def2SVP level of theory (48-52). 


regiochemical assignments across the series of 
products were based on crystal structure analysis 
of 3a, 3ag, and 3ar (CCDC 2250075-2250077). 


Mechanistic investigation 


We then turned our attention to the mecha- 
nism of this photoinduced ring enlargement. 
Stern-Volmer luminescence quenching analysis 
revealed that the excited-state photocatalyst 
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(PC*) is more easily quenched by thiophene 
1a than by BCB 2a, consistent with our orig- 
inal mechanistic hypothesis (Fig. 4A). Cyclic 
voltammetry was then performed to study 
the redox potentials of la and 2a (Fig. 4B). 
An oxidation peak of la in MeCN was ob- 
served at 1.68 V [versus saturated calomel 
electrode (SCE)], and no oxidation peak of 
2a was detected. These results are again con- 


sistent with oxidation of la but not 2a by the 
PC* (oxidation potential E,, = 2.00 V versus 
SCE) (46). Moreover, no desired product was 
generated in the presence of the radical 
scavenger 2,2,6,6-tetramethyl-1-piperidinyloxy 
(TEMPO), thereby suggesting a radical pathway 
(Fig. 4C). The quantum yield ® of the reaction 
between 1a with 2a was determined to be 0.87 


(Fig. 4D). Further intermolecular competition 
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experiments indicated that electron-rich het- 
erocycles are more reactive than electron-poor 
heterocycles, selectively providing corre- 
sponding products (see supplementary mate- 
rials for details). 

Additionally, we performed density func- 
tional theory (DFT) calculations to gain deeper 
insight into the ring enlargement mechanism 
(Fig. 4, E and F, and figs. S9 and S10). The 
experimental studies suggest that the PC* 
oxidizes la to form the radical cation inter- 
mediate [1a]"*. Our DFT calculation revealed 
the highest spin density on C2 and the high- 
est Hirshfeld charge on the benzylic carbon, 
which rationalize the observed BCB insertion 
at C2 (Fig. 4E). The BCB insertion [TS1]"* is 
the rate-limiting transition state with a free- 
energy barrier of 12.6 kcal/mol and offers the 
thermodynamically more stable [IM1]"*. Sub- 
sequent C-S bond formation to produce the 
heterocycle (([TS2]**) is both kinetically facile 
and thermodynamically favored. Reduction 
of [IM2]™* by the photocatalyst is thermo- 
dynamically favored by 11.5 kcal/mol. The 
final C-S bond cleavage (TS3) is kinetically 
facile, and the generation of product 3a is 
irreversible. 

Alternatively, because benzothiophene ex- 
pansion takes place with inverted regioselec- 
tivity to forge 3ah, we hypothesized that the 


3bh, 92% yield 


3bg, 92% yield O 


spin density distribution of the radical cation 
intermediate [1ah]"* is different from that of 
[1a]’*. Indeed, our calculation showed that 
the spin density and Hirshfeld charges are 
evenly distributed over the S-heterocycle. More- 
over, the reaction mechanism with benzo- 
thiophene (Fig. 4F) is different than that with 
3-phenylthiophene (la). The BCB insertion 
(TS4) into the S-radical has a similar free- 
energy barrier of 12.9 kcal/mol, as compared 
with TS1. The formation of INT-5 is en- 
dergonic by 7.2 kcal/mol. Furthermore, the 
subsequent C-C bond formation is the rate- 
limiting transition state with a free-energy 
barrier of 16.1 kcal/mol. Generation of the fused 
heterocycle INT-6 radical cation is exergonic 
by 10.1 kcal/mol. Although we could not locate 
the reduced [IM4], we expect the C-S bond 
cleavage to obtain the product 3ah to be ki- 
netically facile. 


Product elaboration 


Next, synthetic derivatization of the eight- 
membered sulfur-based bicyclic motifs was 
investigated (Fig. 5; see supplementary ma- 
terials for details). Starting from compound 
3a, aldehyde 3ba was successfully produced 
in good yield through selective amide reduc- 
tion with THF as solvent. Since sulfone and 


sulfoxide groups are widely used in organic 


Ph 


DMAP 
EDCs*HCl 


oe Ph 


ate 
82% yield 


Rh(PPh3)3Cl 


H» (10 bar 
6 2 ( ye 


3bf, 87% yield 


3bb, 85% yield 


CpoZr(H)Cl Ino 


synthesis, chemoselective oxidation of 3a was 
carried out and successfully gave sulfoxide 
3bb and sulfone 3be. Tertiary amine 3bd could 
be produced in 64% yield through the reduc- 
tion of the amide carbonyl group with the 
preservation of the carbon-carbon double bond. 
Notably, selective hydrogenation of alkene and 
diene within 3a offered desired products 3be 
and 3bf in 76% and 87% yields, respectively, 
which provides more sulfur-based bicyclic com- 
pounds with different properties. Subsequently, 
with the use of compound 3ai as a starting 
material, substitution with 3-bromoprop-1-ene 
provided allyl ether 3bg by introducing an 
important allyl group. Compound 3ai was also 
converted to aldehyde 3bh and complex ester 
3bi by means of selective oxidation and ester- 
ification, respectively. These successful trans- 
formations of various functional groups further 
illustrate the potential applicability of these 
sulfur-based bicyclic motifs. . 
Given their wide applications in synthetic 
and medicinal chemistry, we used thiophene 
derivatives as the reaction substrates to start 
this ring-opening insertion protocol. Addition- 
ally, because medium-sized bicyclic scaffolds 
with a nitrogen and oxygen are also prevalent 
in both naturally occurring and synthetic bio- 
active molecules, we believe that this ring-in-a- 
heterocycle protocol will facilitate and inspire 


Rh(PPh3)3Cl Ph 
Hp» (1 atm) 


3bd, 64% yield 


3be, 76% yield 


Fig. 5. Synthetic derivatization. Reaction conditions for 3ba: Cp2Zr(H)Cl 

(0.2 mmol), 3a (0.1 mmol), THF (2.5 ml); for 3bb: H202 (0.125 mmol), 3a 

(0.1 mmol), dichloromethane (DCM, 0.2 ml), trifluoroacetic acid (TFA, 0.2 ml); for 
3be: 3-chloroperoxybenzoic acid (m-CPBA, 0.4 mmol), 3a (0.1 mmol), DCM 

(2 ml); for 3bd: LiAIH, (0.744 mmol), 3a (0.1 mmol), THF (2 ml); for 3be: Rh(PPhs)3C! 
(0.015 mmol), 3a (0.1 mmol), Hz (1 atm), PhMe (0.5 ml), 9 hours; for 3bf: 
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Rh(PPh3)3Cl (0.015 mmol), 3a (0.1 mmol), Hz (10 bar), PhMe (0.5 ml), 5 hours; for 
3bg: NaH (0.096 mmol), allyl bromide (0.12 mmol), 3ai (0.08 mmol), DMF (1.3 ml); 
for 3bh: pyridinium chlorochromate (PCC, 0.15 mmol), 3ai (0.1 mmol), DCM 

(0.5 ml); and for 3bi: 3ai (0.1 mmol), probenecid (0.12 mmol), 4-dimethylaminopyridine 
(DMAP, 0.12 mmol), 1-(3-dimethylaminopropyl)-3-ethylcarbodiimide hydrochloride 
(EDC-HCl, 0.15 mmol), DCM (0.5 ml). 
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the development of skeletal ring enlargements 
of such heterocyclic compounds through in- 
sertion of small rings. 
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QUANTUM SIMULATION 


Direct observation of nonlocal fermion pairing in an 
attractive Fermi-Hubbard gas 


Thomas Hartke*, Botond Oreg, Carter Turnbaugh, Ningyuan Jia, Martin Zwierlein 


The Hubbard model of attractively interacting fermions provides a paradigmatic setting for fermion 
pairing. It features a crossover between Bose-Einstein condensation of tightly bound pairs and 
Bardeen-Cooper-Schrieffer superfluidity of long-range Cooper pairs, and a “pseudo-gap” region where 
pairs form above the superfluid critical temperature. We directly observe the nonlocal nature of 
fermion pairing in a Hubbard lattice gas, using spin- and density-resolved imaging of ~1000 fermionic 
potassium-40 atoms under a bilayer microscope. Complete fermion pairing is revealed by the vanishing 
of global spin fluctuations with increasing attraction. In the strongly correlated regime, the fermion 
pair size is found to be on the order of the average interparticle spacing. Our study informs theories of 
pseudo-gap behavior in strongly correlated fermion systems. 


ong-range Cooper pairs form in a Fermi 
gas for even the weakest attraction be- 
tween fermions. With increasing inter- 
action, fermion pairs become more tightly 
bound, as the system undergoes a smooth 
crossover from Bardeen-Cooper-Schrieffer (BCS) 
superfluidity toward a Bose-Einstein conden- 
sate (BEC) of molecular pairs (7-3). In the BCS 
limit, pair formation and the onset of super- 
fluidity occur at the same temperature, but in 
the crossover, pairs are expected to form at 
temperatures above the critical temperature 


Fig. 1. Atom-resolved detection of an attractive 
Fermi-Hubbard gas. (A) Qualitative phase diagram 
of the attractive Fermi-Hubbard model versus 
on-site attraction U/t and temperature T/t at 
density n=0.8 (9-13). Below a critical temperature 
T,, attractive fermions form a BCS or BEC 
superfluid (SF). In the pseudo-gap regime between 
T, and pairing temperature T*, accessed in this 
work (white shading), increasing attraction 

drives pair formation, with pairs exhibiting charge 
density wave (CDW) and superfluid correlations. 
(B) Measured doublon density d (circles) at 

fixed density n versus U/t, from the noninteracting 
limit d = (n/2)* triangles) to the fully paired 
limit d = n/2 (squares), with representative 
images of the full density in ~20 x 20 site regions 
shown above. (C) Snapshot of full spin-and- 
density readout of a strongly correlated gas at 
U/t = 8.4(4) and T/t = 0.36(5). The spin up 
(blue), spin down (red), and combined images 
(rightmost panel) are obtained via bilayer quantum 
gas microscopy (23, 25). 
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T. for superfluidity; the onset pair-formation 
temperature is usually called T*. In this so- 
called “pseudo-gap” regime, the pair size should 
be on the order of the interparticle spacing and 
pairing strongly affected by many-body effects 
(4, 5). The character of this strongly correlated 
regime, situated between a Fermi liquid and 
a normal Bose liquid, is a matter of debate, 
whose resolution should affect understanding 
of other strongly coupled fermion systems, 
such as the high-T, cuprates and twisted bi- 
layer graphene (6-8). The rich physics of the 
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BEC-BCS crossover is captured by the attra, cect 


Fermi-Hubbard model, a spin-1/2 gas of-- 
mions hopping on a lattice with on-site inter- 
actions between unlike spins (9-7). Through 
a particle-hole transformation, it stands in 
one-to-one correspondence with the repulsive 
Hubbard model (78, 19), which under charge 
doping is believed to hold the key toward un- 
derstanding high-temperature superconduc- 
tivity. The model can be realized using neutral 
fermionic atoms in optical lattices with tun- 
able interactions. Recent investigations have 
found spectral gaps (20), correlations between 
local pairs (27), and evidence for interspin cor- 
relations from density profiles (22). 

In this work, we observe the formation and 
spatial ordering of nonlocal fermion pairs in the 
pseudo-gap regime of an attractive Hubbard 
gas confined to two dimensions. We use bilayer 
quantum gas microscopy to detect the in situ 
location and spin of each fermion in every exper- _ 
imental shot (23-25). Access to microscopic spin 
and density correlations reveals the formation 
of nonlocal pairs, the development of long- 
range spatial correlations between pairs, and 
the interplay of pair fluctuations with this 
density-wave order. 
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Simulating the attractive Fermi-Hubbard model 
The phase diagram of the attractive Fermi- 
Hubbard model is shown in Fig. 1A as a func- 
tion of the attractive on-site interaction strength 
U, tunneling amplitude ¢t, and temperature 
T (9-13). For weak attraction U « t, a BCS 
superfluid of long-range fermion pairs forms 
with 7, = T*, reflecting the exponentially 
weak pair binding. In the opposite limit of 
strong attraction U > ¢, all fermions are bound 
into local on-site pairs below a dissociation 
temperature 7* ~U. These pairs condense 
at the critical temperature of Bose-Einstein 
condensation T,, proportional to the pair 
density n, and pair tunneling rate t, ~ ¢?/U. 
A peak of the condensation temperature 
T,/t = 0.2 is expected to occur at U/t ~ 6 and 
density n ~ 0.8 (9-13). Above the transition 
temperature, superfluid correlations compete 
with the formation of a checkerboard charge 
density wave (CDW) (2]). At half-filling (den- 
sity nm = 1), this competition persists down 
to T = 0 and prevents condensation. In this 
work we use a filling n= 0.8, staying in a 
regime where the ground state is a paired 
superfluid (9, 75). 

We here experimentally realize the attractive 
Fermi-Hubbard model using a two-species gas 
of degenerate fermionic *°K atoms trapped 
within an optical lattice (23, 25). The Hubbard 
tunneling amplitude ¢ is controlled by the 
lattice depth, and the interaction strength U is 
tuned via the magnetic field. A bilayer quan- 
tum gas microscope reveals the location of each 
fermion. 

As a first measure of strong pairing in the 
attractive Hubbard gas, we detect the density 
d of doublons (doubly occupied lattice sites) for 
increasing interaction strength U/t across the 
phase diagram in Fig. 1A. At fixed density n, d 
increases from the noninteracting limit d = 
(n/2)° of random encounters of unlike spins to 
the fully paired limit d = n/2 (Fig. 1B) (17). At 
intermediate attraction, strong checkerboard 
ordering of doublons is observed, shown in 
Fig. 1C at U/t = 8.4(4) and T/t = 0.36(5). 

Multiple neighboring sites containing a sin- 
gle spin up and spin down are present among 
doublons in Fig. 1C. These correlated pairs of 
single spins are evidence of the nonlocal na- 
ture of fermion pairs. The microscopic mech- 
anism is the virtual dissociation of a doublon 
into spatially separate pairing partners, with 
matrix element t¢ and intermediate energy cost 
U, which perturbatively lowers the energy of 
a pair by 4¢?/U. Because pairs are composed 
of fermions, dissociation can only occur if a 
nearby site does not already contain a like spin. 
This leads to effective nearest-neighbor repul- 
sive interactions between pairs (23), which 
in turn are the source of long-range CDW order. 
The presence of these delocalized pairs also 
demonstrates that the doublon density d is an 
incomplete measure of pairing. 
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Fig. 2. Observation of nonlocal fermion pairing. (A) Experimental snapshots of the Ferm 
U/t = 5.8(3), and U/t = 8.4(4) (left to right), showing the formation of nonlocal pairs and on-site pairs with ¢ 
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increasing attraction. Schematics above highlight the physics dominating spin correlations in each image, 
and shaded bonds suggest possible pair correlations. (B) Correlation maps (r7iM;.5), of the magnetization 
m=n, —n, at various U/t. (C) Total magnetization fluctuations > (MiMi+s), (blue circles) and on-site 


é 


fluctuations (black squares) versus U/t. Total fluctuations equal the product of magnetic susceptibility x,,, and 
temperature T via the fluctuation-dissipation theorem (23). Vanishing total spin fluctuations for U/t=6 
(orange shading) indicate full pairing and vanishing y,,. Blue shading shows quantum Monte Carlo simulations 
of total fluctuations at n = 0.85, from T/t = 0.3 to T/t = 0.4 (23). The pairing temperature T* crosses 

T = 0.35t at U/t = 2.5. Nonlocal pairing is reflected in the singlon fraction per total density s/n (upper inset), 
which scales as ~8t?/U? (gray line) at large attraction. Fluctuations at U/t = 5.8(3) (lower inset) extend 
beyond the interparticle spacing 1/,/mn; (dotted line). All data and error bars are obtained from 
bootstrapping greater than 50 images of atomic clouds with imaging loss correction (23). 


Detecting fermion pairing 

A true signature of pairing that accounts for 
these nonlocal pairs is the vanishing of total 
spin fluctuations. Indeed, a system in contact 
with a surrounding particle bath will generally 
display fluctuations of the total magnetiza- 


tion M = Sinai), where the magnetization 
i 


m =n; — n,. However, pair formation sup- 
presses spin fluctuations, as pairs do not 
contribute to M, and thus in a fully paired 
system the variance o;, vanishes. This variance 
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Fig. 3. Charge density wave ordering of pairs. (A) Density correlations (nifi,s), reflect the crossover from 
a noninteracting gas (left), to a fully paired gas with CDW order (center), to a weakly ordered gas of 

local pairs (right). (B) At U/t = 8.4(4), the nonlocal rectified correlations (Aijfiss).(—1)°** are well 
described by long-range exponential decay. Dotted line shows the interparticle spacing 1/,/nn;. (C) The 


density response y,, (red circles) and interspin response x, (blue squares) at wave vector k = (x, 7) Serve as 
order parameters for CDW correlations. These susceptibilities are obtained via the fluctuation-dissipation 


theorem Xa(K = (/T) D> (ins) cos(k - 8) and xy, (K )2 (1/7) > (An fiss)_c08(k 8) (23). Inset: 
a = 5 8 
Xn (K) versus k at U/t = 0 (black) and U/t = 8.4(4) (red). (D) Fluctuation thermometry: The measured 


density fluctuations 2 (fifa. 


), (top) and uniform compressibility On/Op = x, (k = (0,0)) (middle) 


are combined to directly obtain the temperature T/t (bottom). 


is measured locally in our quantum gas mi- 
croscope through the sum of connected cor- 
relations o3,/(Area) = >» (MiMi+s)., Where 
= (thitniss) (si) (triss). 

The magnetization fluctuations are directly 
connected to the magnetic susceptibility y,,, = 
Om/Oh, the response of the magnetization to a 
global magnetic field h, through the fluctuation- 
dissipation theorem ¥,,T = >, (MiMi+s) ¢ (23). 

5 
An energy gap for spin excitations, which ex- 
ponentially suppresses excess spins and thus 
» (™M;Mi+5), also exponentially suppresses 


1, (26). 

Figure 2 reports a crossover to full fermion 
pairing beyond an interaction strength U/t ~ 6 
at n = 0.8(1) and T/t = 0.35(5), determined 
by in situ observation of magnetization fluc- 
tuations. The reduction in fluctuations is in 
good agreement with theoretical predictions 
for these parameters (12-/4). Figure 2A high- 
lights the physical mechanisms that determine 
spin fluctuations at various U/t. At vanishing 
interactions, Pauli exclusion separately reduces 
the density fluctuations of each spin, and there- 


(MiMi+s) ¢ 
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by also reduces total spin fluctuations. With 
increasing attraction, nonlocal pairs form in 
which spins are subject to a competition of 
Pauli exclusion and attraction, while deep in 
the on-site pair regime spin fluctuations reflect 
virtual hopping onto neighboring sites. From 
statistical averages over more than 50 spin 
configurations, as shown in Fig. 2A for each in- 
teraction strength, we obtain the two-dimensional 
(2D) magnetization correlation maps (772;77:+8) ¢» 
shown in Fig. 2B. To detect pairing, Fig. 2C 
presents the sum of these correlation maps, the 
total magnetization fluctuations, which are 
fully suppressed beyond U/t=6. Already at 
zero interactions, Pauli exclusion reduces total 
fluctuations by 68(5)% compared to the high- 
temperature expectation n(1— 7/2). This re- 
flects the substantial degeneracy of the Fermi 
gas (T/T ~ 0.1, where Ty is the Fermi tem- 
perature). Increasing attraction reduces mag- 
netization fluctuations further, and the fraction 
of unpaired spins is less than 1.5(1.8)% at 


U/t = 5.8(3), where > (7ititiss), gives the 


3 
density of unpaired spins. This full suppression 
is dual to the formation of a Mott insulator for 
repulsive interactions (18, 19, 23, 25). 


The suppression of fluctuations in Fig. 2C 
with increasing U/t signifies the development 
of an energy gap for spin excitations (26). The- 
ory predicts (10-14, 27, 28) a pairing tem- 
perature T* ~ 0.25U in the crossover regime 
(23). This predicted 7* crosses T ~ 0.35t near 
U/t = 2.5, explaining the near-complete sup- 
pression of fluctuations beyond U/t = 6. The 
corresponding expected spin excitation gap 
far exceeds the two-body binding energy Ey, 
highlighting the many-body nature of pairing. 

Within this regime of full pairing, the two 
atoms within a pair have a finite probability 
of being located on separate lattice sites be- 
cause of quantum fluctuations, in which case 
they appear as isolated single atoms (called 
singlons). This probability is measured by the 
ratio of the singlon density to the total density, 
s/n (Fig. 2C, upper inset). The observed scaling 
of s/n with t?/U? at strong attraction is ex- 
pected from perturbation theory already for . 
a Fermi-Hubbard double well (25, 29). At 
U/t = 5.8(3), the nonlocal portion of the pairs 
amounts to ~20%. The effective size of fermion 
pairs can be obtained as the spatial extent of 
nonlocal spin fluctuations. With full pairing at 
U/t = 5.8(3), spin fluctuations are present be- 
yond the single-spin interparticle spacing (Fig. 
2C, lower inset), indicating that fermion pairs 
overlap substantially. 


Measuring charge correlations 


Characteristic for the pseudo-gap regime is a 
predicted strong departure from Fermi liquid 
behavior, in which spin and charge fluctua- 
tions are similar (4, 5). Having established the 
existence of nonlocal fermion pairs through 
vanishing magnetization fluctuation, we there- 
fore now explore charge (i.e., density) correla- 
tions of the gas. Whereas for weak interactions 
charge and spin correlations are closely asso- 
ciated, for stronger attraction we instead find 
spatial ordering into a CDW across the phase 
diagram of Fig. 1A. Previously, evidence for 
CDW order has only been observed in doublon- 
doublon correlations and at a fixed interaction 
strength (27). In Fig. 3A, beginning without 
interactions, we observe negative nonlocal den- 
sity correlations (7775), for nearest-neighbor 
and diagonal correlations. These correlations 
are a simple and direct manifestation of the 
Pauli hole experienced by a single spin (25, 30). 
Indeed, here (7;7;+5), is precisely equal to 
(Rit Nt) + (Nil Nj|) ¢ because the measured in- 
terspin correlations (7;7;,),. vanish (23). For 
increasing attraction, the Pauli hole gives way 
to the positive checkerboard long-range density 
correlations, shown versus distance in Fig. 3B 
at U/t = 8.4(4). Further increase in U/t re- 
duces the observed CDW strength, likely as a 
result of smaller effective repulsion between 
more-localized pairs (Fig. 2B). Notably, we 
measure a negative sum of nonlocal inter- 
spin correlations (7;;7:+3,), for any attractive 
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Fig. 4. Interplay of nonlocal pairs and many-body order. (A) When local pairs in a strongly correlated 
attractive system virtually fluctuate into spatially separated single spins, further tunneling can disturb the 
underlying many-body ordered state. (B) Polaronic correlations are observed by measuring disturbances of 
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peak in the background CDW strength is observed near U/t = 8 for various displacements 3 (left panel), 
strong reduction in CDW strength is observed after conditioning on the presence of a nearby isolated spin 
(right panel). Right inset shows the relative location of the single spin (purple circle) and CDW bond. 
relative change in CDW strength Acpw reveals the spatial extent of polaronic correlations at U/t = 8.4(4). 
Acpw decays with distance |r| from the singlon to the CDW bond. The shaded region represents the extent of 
he disturbance to the background order. (D) A symmetrized 2D map of (C), with lattice sites represented by 


C) The 


black dots and the isolated spin at center. In the CDW strength, di (hi) denotes a doublon (hole, or empty site). 


interaction (23), revealing that a single t atom 
in total repels spin | atoms on all other sites. 
This constitutes a strong direct signature of 
effective repulsion between pairs. 

The development and destruction of CDW or- 
der across the phase diagram of Fig. 1 can be cap- 
tured by the density response x,, at wave vector 
k = (n,n) (Fig. 3C), which reflects the preva- 
lence of low-energy states with checkerboard 
order. Highlighting the power of quantum gas 
microscopy, this thermodynamic property can be 


Hartke et al., Science 381, 82-86 (2023) 7 July 2023 


measured in equilibrium using the fluctuation- 
dissipation theorem for density perturbations, 


wn ( \= (1/T) oD (Rifts) 008 (H . 3) , and 


3 
the measured uniform density compressibility 
Xn (Xe = (0,0)) = dn/du (Fig. 3D) (25). This 
same method ‘allows measurement of y,, t) 
throughout the Brillouin zone (Fig. 3C, inset) 
and provides a model-independent measure- 
ment of temperature T (Fig. 3D) (25, 37). The latter 
enables obtaining the magnetic susceptibility 


%m from the measured spin fluctuations with- 
out applying a magnetic field (23, 32). As ex- 
pected from the phase diagram in Fig. 1, the 
peak in CDW order occurs near U/t = 6. Also 
displayed are the interspin correlations, obtained 


from y,, (7) = (1/7) (ft ftiss))008(-8). 


Although opposite spins are uncorrelated at 
U/t = 0, they are seen to almost fully carry the 
CDW order beyond U/t ~ 6. Because density, 
magnetization, and interspin correlations are re- 
lated by (?;7i+5). — (MiMi4s). = A(Nip Nits)» 
the relative agreement of x, and x, , illustrates 
the strength of density order as compared to 
magnetic order at k = (x,7). The pronounced 
CDW peak is also a signature of strong super- 
fluid correlations within the crossover regime, 
as CDW correlations away from half-filling 
serve as a lower bound for superfluid correla- 
tions (21, 33). 


Polaronic correlations 


Given simultaneous charge and spin measure- 
ments, we finally explore the interplay of non- 
local pair fluctuations and the CDW order of 
other pairs, revealing the existence of polar- 
onic correlations in the CDW order of the 
attractive Hubbard model. Polaronic corre- 
lations occur in the regime of highly nonlocal 
pairs, where further tunneling of a separated 
pair can dislocate the CDW checkerboard or 
flip the sign of superfluid correlations (Fig. 4A). 
These tunneling events prevent the virtual 
delocalization of other pairs across the bonds 
where the order has been reversed, costing 
an additional 4¢?/U per bond in the strong- 
coupling limit and further confining spatially 
separated pairs (34). This mechanism is di- 
rectly complementary (18, 19, 23) to the mag- 
netic polaron mechanism of the repulsive 
Fermi-Hubbard model (35, 36), though here 
polaronic correlations dress the individual 
spins of a spatially separated fermion pair, 
rather than excess dopants. 

In Fig. 4B, we compare the CDW correlations 
surrounding single spins to those present in the 
background. We quantify the underlying CDW 
strength as ((d; — hi) (diss - hiss) {1 
for a given displacement 6 , which is positive for 
any 6 for a gas possessing checkerboard doublon- 
hole correlations. This underlying CDW strength 
peaks near an interaction strength U/t = 8. By 
contrast, for various U//t values, the measured 
CDW strength is strongly reduced after con- 
ditioning on the presence of a single nearby 
isolated spin. This reduction substantially ex- 
ceeds the lowest-order expectation of single- 
pair fluctuation events depicted in Fig. 4A, e.g., 
25% for a displacement 6 = (0,1) and50% for 
8 = (1,1) or (2, 0). The measurements directly 
reveal the spatial extent of these polaronic effects, 
captured by the relative change Acpw(|7'|) of 
conditioned to unconditioned CDW strength 
(Fig. 4, C and D). Virtual pair fluctuations disturb 
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the CDW order over a range of ~2 sites at U/t = 
8.4(4), with complete reduction or even re- 
versal of the CDW order on nearby bonds. In 
future work, measurements of four-point cor- 
relations (36) around pairs of spins will further 
elucidate the internal structure of these quan- 
tum fluctuations. 


Conclusions 


Our real-space observations of nonlocal fermion 
pairing and its interplay with CDW order il- 
lustrate the richness of the pseudo-gap regime 
of the attractive Hubbard model. Similar com- 
peting or intertwined orders are predicted for 
the repulsive Hubbard model. The methods 
can be extended further to study polaronic 
physics and superfluidity (37), pairing in mo- 
mentum space as measured in bulk 2D gases 
(38), to detect the m phase shift of CDW order 
across stripes (39), and to directly measure the 
BCS condensate fraction through pair correla- 
tions (40). 
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Paleoenvironments shaped the exchange of 
terrestrial vertebrates across Wallace’s Line 


A. Skeels'?3*, L. M. Boschman’, |. R. McFadden’, E. M. Joyce®, 0. Hagen’, O. Jiménez Robles**, 
W. Bach??, V. Boussange’, T. Keggin’, W. Jetz®?°, L. Pellissier?? 


Faunal turnover in Indo-Australia across Wallace’s Line is one of the most recognizable patterns in 
biogeography and has catalyzed debate about the role of evolutionary and geoclimatic history in biotic 
interchanges. Here, analysis of more than 20,000 vertebrate species with a model of geoclimate and 
biological diversification shows that broad precipitation tolerance and dispersal ability were key for 
exchange across the deep-time precipitation gradient spanning the region. Sundanian (Southeast 
Asian) lineages evolved in a climate similar to the humid “stepping stones” of Wallacea, facilitating 
colonization of the Sahulian (Australian) continental shelf. By contrast, Sahulian lineages predominantly 
evolved in drier conditions, hampering establishment in Sunda and shaping faunal distinctiveness. We 
demonstrate how the history of adaptation to past environmental conditions shapes asymmetrical 


colonization and global biogeographic structure. 


he world’s major biotic interchanges have 

had a disproportionate impact on the dis- 

tribution of the world’s terrestrial fauna 

(1-3). Well-known systems include the 

American interchange, which followed 
the formation of the Panamanian isthmus be- 
tween North and South America (4), and the 
Indo-Australian interchange, which followed 
the convergence of the Australian and Eur- 
asian tectonic plates and was shaped by com- 
plex dynamics of island formation (5). Far 
from an even diffusion of species between re- 
gions, interchanges have displayed an asym- 
metry in the direction of exchange (6-8) and 
have been dominated by particular functional 
traits (9, 10). This unevenness may reflect a 
predisposition for dispersion in particular 
groups of organisms, because successful colo- 
nization is dependent on the ability both to 
disperse to a new region and to establish 
within new environmental conditions and 
ecological communities (17). Dispersal ability 
may act as a primary filter on colonization 
success (72), but lineages with broad environ- 
mental tolerances, or those with the ability 
to adapt to new environments, might contrib- 
ute more to an interchange in the presence 
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of environmental gradients between regions 
(9, 13). To date, our ability to distinguish be- 
tween hypotheses regarding the processes 
that shape the unevenness of different biotic 
interchanges has been limited by the avail- 
able process-based methodologies and paleo- 
environmental reconstructions. 
Distributional patterns of terrestrial verte- 
brates in the Indo-Australian archipelago have 
fascinated naturalist for centuries (14) and 
have provided a model system for understand- 
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ing biotic interchange. As the Australian oot 
tinental plate approached the Eurasian p!u--, 
subduction at the northern boundary led to 
the formation of a geologically complex archi- 
pelago, today composed of thousands of islands 
and known biogeographically as Wallacea (5) 
(Fig. 1). The oceanic boundary separating the 
Sunda continental shelf (including Myanmar, 
Cambodia, Vietnam, Laos, Thailand, Malaysia, 
and Indonesia west of Lombok on the Eurasian 
plate) from Wallacea and the Sahul continental 
shelf (including Australia and New Guinea on 
the Australian plate) is named Wallace’s Line 
after Alfred Russell Wallace (J4) (Fig. 1), who 
noted a clear disjunction in the distribution of 
some taxa across it (14, 15). Today, Wallace’s 
Line marks the boundary of the Indomalayan 
and Australasian zoogeographic realms (2, 16) 
(Fig. 1), but there are many lines in the region 
that mark turnover between a predominantly 
Sundanian or Sahulian fauna (75), such as the 
Heilprin-Lydekker Line (Fig. 1). The persistent 
presence of oceanic barriers has led to the 
dominant idea that dispersal limitation is the 
primary process shaping interchange (72). For 
example, the mammalian faunas of Sunda 
and Sahul are largely endemic, and only vagile 
rodents (17) and bats (18) have been successfully ‘ 
exchanged between continents. However, envi- 
ronmental niche traits may also have played a 
role (19, 20) as the distinctly drier Australian 
continent (20, 21) shaped the evolution of + 


Heilprin- Feel Line Mana and temperate 


Fig. 1. Climate and geography of the Indo-Australian archipelago at present. The Sunda continental 
shelf is part of the Indomalayan zoogeographic realm and is separated from the Australasian zoogeographical 
ealm by Wallace's Line. Wallacea is the island archipelago between the Sunda and Sahul continental shelves. 
The Sahul continental shelf, which includes the Australian mainland and New Guinea, is separated from 
Wallacea by the Heilprin-Lydekker Line. Sunda, Wallacea, and New Guinea are dominated by the tropical 
ainforest KOppen-Geiger climate belt, except for eastern Java, Bali, South Sulawesi, the Lesser Sunda 
isles, and southern New Guinea, which are dominated by the drier tropical savanna climate belt. The 


Australian mainland is dominated by tropical savanna, arid, and temperate climate belts. The climate maps 


are adapted from (48). 
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an arid-adapted fauna (22). The interchange 
has been highly asymmetrical, with rates of 
exchange estimated to be at least twice as 
high from Sunda to Sahul than in the oppo- 
site direction (23). This could be shaped by the 
uneven sizes of the species pools, as stable 
areas of tropical rainforest in Sunda, connected 
to the Eurasian continent, supported a bio- 
diverse fauna throughout the Cenozoic (24). 
Conversely, increasing aridity and a less connected 
continental area bound by oceans may have 
led to a reduced species pool in Sahul. Our 
current understanding of interchanges is pat- 
tern oriented, with historical biogeographic 
analyses determining where, when, and in 
which direction exchanges occur (2, 12, 15, 23, 25). 
A process-oriented understanding, however, 
requires the synthesis of paleogeographic 
and paleoclimatic information with biological 
traits and diversification histories across a 
range of taxa. 

Paleoenvironmental and biological informa- 
tion united under a mechanistic modeling 
approach makes it possible to compare empir- 
ical observations with theoretical expectations 
and provides a framework with which to ex- 
plore the mechanisms behind one of the 
world’s major biotic interchanges. We hypoth- 
esize that, beyond a simple dispersal filter 
across oceanic barriers, paleoenvironmental 
niche dynamics also influence patterns of ex- 


change. In addition to each clade’s dispersal 
ability (72) and the size of the species pool, 
exchange patterns should be related to either 
the variability in the environmental conditions 
in which species occur (25) (mean realized 
niche breadth) or the rate at which environ- 
mental niche traits change over time (73) 
(rates of niche evolution). Further, the spa- 
tial distribution of climate, and how it has 
changed through time, should also explain 
asymmetry and the spatial distribution of 
lineages with either Sundanian or Sahulian 
ancestry (19). Here, we combined paleoenvi- 
ronmental reconstructions of temperature, 
precipitation, and plate tectonics over the past 
30 million years (Ma) (26-28) with a mecha- 
nistic model of biodiversity (29) to explore 
how dispersal and niche traits interact with 
the environment to shape regional diversifi- 
cation and patterns of exchange through time 
(30). The selected time period captures the on- 
set of the convergence of the Australian and 
Eurasian plates (5) and the origin of most ter- 
restrial vertebrate families in the region (fig. S1). 
We used published, well-sampled phylogenetic 
and spatial information on 20,433 species be- 
longing to all 227 families of terrestrial vertebrates 
present in the Indo-Australian archipelago 
to estimate phylogenetic and taxonomic f di- 
versity across Wallace’s Line as measures of 
compositional dissimilarity between the two 


regions, as well as the number and direction 
of colonization events (37). 


Trait-based drivers of faunal exchange 


We estimated a minimum of 381 independent 
colonization events across Wallace’s Line based 
on a conservative phylogenetic hypothesis for 
each family [figs. S2 to S5 (30)], explaining the 
widespread distribution across Sunda and 
Sahul of 87 out of 227 terrestrial vertebrate 
families (Fig. 2 and Fig. 3A). Older colonization 
events were associated with greater species 
diversity (Spearman p = 0.68; Fig. 2A), most 
likely because these older lineages had a longer 
time to diversify. Precipitation niche breadth 
was the most consistent predictor of exchange in 
the Indo-Australian archipelago, indicating that 
colonization is more common in lineages that 
can tolerate wider variation in precipitation. 
Using phylogenetic multiple regression (table 
SI), we found that families with broader pre- | 
cipitation niche breadths were more likely 
to be exchanged between regions, to have a 
larger number of exchange events, and to have 
lower taxonomic B diversity and phyloge- 
netic f diversity (Fig. 3B and table S1) across 
Wallace’s Line. Temperature niche breadth 
was negatively associated with exchange dy- ‘ 
namics (Fig. 3B and table S1), because tem- 
perature and precipitation niche breadths 
are inversely related in vertebrates (32). The 
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Fig. 2. Colonization dynamics of terrestrial vertebrates and geoclimatic 
changes in Indo-Australia. (A) Bars show the log-transformed number of 
species derived from independent colonization events across Wallace's Line from 
Sunda to Wallacea (blue) or in the opposite direction (yellow). Dashed lines 
show the number of colonization events through time. (B) Paleogeography 

(28) and paleoclimate (26, 27) of the Indo-Australian archipelago over the past 
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30 Ma (26, 28). (C) Numbers of families crossing between regions across 

two different biogeographic boundaries. Principle component analysis was 
performed on paleotemperature and paleoprecipitation to decompose a single 
climate axis (ClimPCl), explaining 88% of the variation in climate, where similar 
colors represent similar climates. Warm, wet conditions are shown in red; cold, 
dry conditions are shown in blue. 
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mechanistic model enabled the disentangle- 
ment of these factors and validated the role 
of broader precipitation niche breadths in 
the rate of exchange (table S2). Niche breadth 
may affect colonization dynamics through two 
pathways: (i) Clades more tolerant of a broad 
range of conditions may be less prone to ex- 
tinction and therefore have a larger number 
of potential dispersing lineages (8), or (ii) they 
may be more able to establish in a diverse 
range of environmental conditions with greater 
establishment success. Our mechanistic model 
supports the second pathway, because higher 
rates of colonization were not associated with 
lower rates of extinction (fig. S6), but instead 
associated with certain dispersal and niche 
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Fig. 3. Exchange dynamics and precipitation niche breadth of terrestrial 
vertebrates on a family-level phylogenetic tree and effect size of the 
predictors of exchange. (A) The circle in the center displays each family’s 
reconstructed origin as Sunda, Sahul, or uncertain (equivocal support) based on 


TF. sa 14 7 
Sern 
gyal 4 Laity ” ae 


characteristics of the modeled lineages. Al- 
though the simulation results cannot exclude 
the role of differential extinction on coloni- 
zation dynamics [e.g., (8)], they do suggest that 
differences in the rate of lineage diversifica- 
tion are not required to explain asymmetrical 
exchange. These results support the hypoth- 
esis that the precipitation gradient between 
Sunda and Sahul is a sharp barrier to the estab- 
lishment of range-expanding lineages (10, 3.3), 
and that lineages able to persist in a diversity 
of conditions can overcome this barrier. 
Supplementing the effect of climatic niche 
breadth, our empirical and in silico model 
results support the importance of dispersal 
ability and the species pool in exchange pat- 


terns. More diverse biotas might facilitate in- 
terchange by having a greater number and 
variation of potential colonist species, increas- 
ing the number of potential colonization at- 
tempts (34), and this pathway is considered 
the null expectation in invasion biology (35). 
We found that the size of the source species 
pool was positively associated with exchange 
success and the number of colonization events 
in terrestrial vertebrates (Fig. 3B and table S1), 
as well as with all exchange metrics in the 
mechanistic model (table S2). As a key dis- 
persal trait, flight was negatively associated 
with phylogenetic f diversity (Fig. 3B), and 
most exchanged taxa were highly volant, with 
64% of families being either birds or bats, 
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11, Lagomorpha; 12, Probosoidea; 13, Diprotodontia; 14, Dasyuromorphia; 15, 
otoryctemorphia; 17, Monotremata; 18, Anura; 19, 
Caudata; 20, Gymnophiona; 21, Passeriformes; 22, Psittaciformes; 23, 
Falconiformes; 24, Piciformes; 25, Coraciiformes; 26, Bucerotiformes; 


biogeographic estimation models. The middle colored ring shows whether the 
family is present in both Sunda and Sahul and was reconstructed to have 
been exchanged across Wallace's Line. In the outer colored ring, the color and 
height of the bars represent the mean precipitation niche breadth of the 
family. Niche breadth was measured as the SD of mean annual rainfall across 
each species distribution (in millimeters per year). The numbered lines at the 
perimeter indicate the number of taxonomic orders, with representative taxa 
highlighted as silhouettes (images are from PhyloPic: http://phylopic.org/). 

1, Carnivora; 2, Pholidota; 3, Cetartiodactyla; 4, Perrissodactyla; 5, Chiroptera; 
6, Eulipotyphla; 7, Primates; 8, Scandentia; 9, Dermoptera; 10, Rodentia; 
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27, Trogoniformes; 28, Strigiformes; 29, Accipitriformes; 30, Charadriiformes; 31, 
Apodiformes; 32, Caprimulgiformes; 33, Eurypygiformes; 34, Suliformes; 35, 
Pelecaniformes; 36, Ciconiiformes; 37, Procellariiformes; 38, Sphenisciformes; 
39, Gruiformes; 40, Cuculiformes; 41, Otidiformes; 42, Phaethontiformes; 

43, Podicipediformes; 44, Columbiformes; 45, Galliformes; 46, Anseriformes; 47, 
Casuariiformes; 48, Squamata. (B) Coefficients of phylogenetic linear models 
of four exchange metrics with mean temperature and precipitation niche 
breadths, rates of temperature and precipitation niche evolution, dispersal ability, 
and species pool size. Predictors that were statistically significant based on 
phylogenetic linear models are in bold, and nonsignificant predictors are gray. 
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Fig. 4. Age of paleoclimatic space in the Indo- 
Australian archipelago and distribution of 
climatic niche centroids of terrestrial vertebrates. 
(A and B) Gridded representation of climate space 
defined by temperature and precipitation. Each tile in 
the grid represents a unique combination of these two 
variables. The tiles are colored by the age of that 
climate’s first appearance in Sunda (A) and Sahul (B). 
Climate spaces that have been present longer are in 
darker colors. The climatic space occupied by the 
“stepping stone” region of Wallacea during the past 
30 Ma is highlighted with an orange dashed line. This 
space has been present longer in Sunda than in Sahul. 
Symbols indicate the median climate niche centroid 
of species from Sahul (yellow) or Sunda (blue) that 
belong to families that have been exchanged (circle) or 
have not been exchanged (star). (C) Proportion of 
species of Sahulian origin in climate space. Species 
niche positions in climate space show that the area 
occupied by Wallacea is more densely filled with 
species from Sundanian families than with species 
from Sahulian families, and that species from both 
exchanged and nonexchanged Sundanian families 

are more commonly found in the climate space 
occupied by Wallacea. 
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accounting for 93% of the colonization events 
of Sunda. All exchange metrics were strongly 
positively associated with dispersal ability in 
the mechanistic model (table $2). We found 
that exchange of bird families across the Indo- 
Australian archipelago was proportionally 
higher than during other major biotic inter- 
changes (36), reflecting the greater propensity 
of volant taxa to disperse over the deep-sea 
oceanic barriers such as Wallace’s Line, which 
uniquely define the Indo-Australian interchange. 

Contrary to our prediction, rates of niche 
evolution were not commonly associated with 
exchange dynamics (Fig. 3B and tables S1 and 
82), possibly reflecting the widespread pheno- 
menon of niche conservatism, the tendency 
for lineages to retain their ancestral environ- 
mental niche through time (37), during trans- 
oceanic colonization (38). For instance, many 
families that we reconstructed as having a 
Sundanian origin, such as Colubrid snakes and 
Pteropodid bats, are typically restricted to 
similar biomes in Sahul. Instead of the ability 
to rapidly adapt to new environmental condi- 
tions, we show that broader precipitation niche 
breadths and higher dispersal ability were 
the primary biological traits that allowed line- 
ages to expand their geographic ranges in the 
face of oceanic barriers and a steep precip- 
itation gradient. 


Paleoclimate and exchange asymmetry 


We demonstrate that asymmetrical rates of 
exchange have been shaped by the distinctly 
different paleoclimatic histories of Sunda and 
Sahul. At least 46 families originating in Sunda 
(43% of all Sundanian families) crossed the 
Heilprin-Lydekker Line (Fig. 1 and table S3) 
to Sahul, whereas only 19 families originating 
in Sahul (24% of all Sahulian families) crossed 
Wallace’s Line to Sunda. A necessary step to 
colonization involves crossing Wallacea (Fig. 1). 
Our paleoenvironmental reconstruction, which 
combines paleotopography (28), reconstructed 
Koppen belts (27), and a global circulation 
model (26), show that the humid climate space 
occupied by Wallacea emerged in Sunda at 
least 20 and 30 Ma (Fig. 4A and fig. S7) ina 
period associated with the spread of tropical 
rainforests (21, 24). Conversely, in Sahul, areas 
with similarly humid conditions were present 
during the mid-Miocene (fig. S8) (39) and only 
became widespread in their contemporary lo- 
cation on Sahul at least 10 and 5 Ma (Fig. 4B), 
after the northward movement of Sahul and 
the uplift of New Guinea (20, 24). An almost 
uninterrupted humid climate throughout the 
Indo-Australian archipelago, from Sunda to 
New Guinea, facilitated the establishment of 
Sundanian lineages in Sahul, and this can 
also explain the observation that Wallacean 
and New Guinean flora are compositionally 
more similar to the flora of Sunda than to 
that of the Australian mainland (33). Support- 


ing this finding, we show that the climate 
space currently present in Wallacea and New 
Guinea is more densely occupied by vertebrate 
lineages from Sunda than by lineages from 
Sahul (Fig. 4C). By contrast, species from Sahul 
in families that did not colonize Sunda occupy 
an older and drier component of the Sahulian 
climate than species in families that colonized 
Sunda (Fig. 4C and fig. S9). A higher density of 
species in older climate space suggests that 
niche evolution has been truncated through 
time by the available climate on each continent, 
highlighting that niche conservatism limits 
exchange between Sunda and Sahul. 

Greater connectivity of a humid climate at 
the interface of the exchange explains the ob- 
served patterns of asymmetrical interchange 
in terrestrial vertebrates, and this was sup- 
ported by the mechanistic model. In the mod- 
el, evolution of climatic niches was directed by 
the environmental conditions where species 
evolved (fig. S7). Therefore, Sundanian line- 
ages evolved in niches closer to the condi- 
tions of Wallacea and colonized emergent 
Wallacean islands earlier (mean colonization = 
15.27 Ma; fig. S10). In the opposite direction, 
Sahul lineages were generally unable to suc- 
cessfully colonize proto-Wallacea. Coloniza- 
tion events by Sahulian species across Wallace’s 
Line generally occurred after the uplift of 
New Guinea’s highlands (mean colonization 
time = 5.4 Ma; fig. S10), supporting the role 
of New Guinea as an ecological “stepping 
stone” between Wallacea and the Australian 
mainland (40). 

We evaluated how climate connectivity ex- 
plains exchange asymmetry by performing 
an in-silico counter-factual simulation experi- 
ment. Here, we either removed the precipi- 
tation or temperature constraints on species 
distributions or removed the tectonic history 
by holding the landscape constant with sea 
levels at the time period with maximum con- 
nectivity between the regions (Last Glacial 
Maximum, ~20,000 years ago). This effectively 
allowed dispersal within each continental re- 
gion to be unconstrained by oceanic barriers 
and provided equivalent geographic isolation 
of each continental region from Wallacea. We 
found that the proportion of Sahulian species 
in Wallacea and New Guinea increased only 
when both the precipitation constraint and 
tectonic history were modified together, be- 
cause this promoted more opportunities for 
dispersal and a higher likelihood of establish- 
ment (Fig. 5 and figs. S11 and S12). The propor- 
tion of simulations with colonization events 
between Sahul and Sunda was 16.6% in the 
full model, which was similar to the observed 
proportion in terrestrial vertebrates (24%). 
The proportion increased to 48.6% in the 
counterfactual model (fig. S11), which is more 
similar to the observed proportion of line- 
ages moving in the opposite direction, from 
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Fig. 5. Spatial distribution of the proportion of terrestrial vertebrate 
species of Sahul origin and biogeographic turnover predictions from the 
biodiversity simulation model. Proportion of total species richness in each 
grid cell that belongs to families of Sahul origin, based on biogeographic 
estimation, over the count of all species in the grid cell, for all terrestrial 
vertebrates (A), birds (B), amphibians (C), mammals (D), and reptiles 

(E). Dashed lines in (A) show the location of the sharpest spatial turnover 

of proportional Sahulian richness under six different simulations models 

(M1 to M6), as well as under high (M1H) and low (MIL) values of precipitation 


Sunda to Sahul (43%). Therefore, more sym- 
metrical exchange could only be observed in 
the absence of climatic and dispersal bar- 
riers. A similar climate-mediated scenario has 
been suggested to explain exchange asym- 
metry during the American interchange (9, 41) 
and between India and Eurasia (7, 42). Our 
study clarifies how distinct climatic histories 
of continental regions can generate a near- 
ubiquitous pattern of asymmetrical terres- 
trial biotic interchanges. 


Dispersal and environmental niche shape 
spatial patterns of faunal exchange 


Patterns of turnover among taxa have long 
been a source of contention for designating bio- 
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geographic boundaries in the Indo-Australian 
archipelago (15) because there is a high dis- 
cordance in the distributional extent of differ- 
ent taxa across the Indo-Australian archipelago. 
Across the four classes of terrestrial vertebrates, 
gradients of turnover from a predominantly 
Sundanian to a predominantly Sahulian fauna 
differed, with a high proportion of sites in 
Sahul having high richness of Sundanian rep- 
tile species (100% of sites; Fig. 5E) and mam- 
mal species (68.8%; Fig. 5D) (which is driven 
by the exceptional diversity of Sundanian 
squamates, rodents, and bats), but not for 
amphibian species (11%; Fig. 5C) or bird spe- 
cies (1.4%; Fig. 5B). In the mechanistic model, 
a combination of dispersal and niche traits 


T 
150°E 


niche breadth and dispersal ability parameters of the full model (M1). The 
precipitation constraint on species distributions is removed in M2, the 
temperature constraint is removed in M3, tectonic history is removed in M4, 
both the precipitation constraint and tectonic history are removed in M5, 
and both the temperature constraint and tectonic history are removed in M6. 
The dashed lines illustrate how removing certain constraints that limit 
colonization changes the distribution of species from families with either 
Sahulian or Sundanian ancestry. Grid cell size is 110 x 110 km for all panels 


shaped gradients in biotic turnover, which 
could explain the emergence of these differ- 
ent patterns in vertebrates. When simulations 
were run with increased species’ precipitation 
niche breadth and dispersal ability parame- 
ters, we found that lines demarcating turn- 
over from a Sundanian to Sahulian biota (Fig. 
5A) were located southward into regions of 
increased aridity, varying from a pattern corre- 
sponding with amphibians (Spearman’s p = 
0.85 to 0.87; Fig. 5C) to a pattern correspond- 
ing with mammals (p = 0.71 to 0.74; Fig. 5D), 
respectively. For birds, New Guinea is predom- 
inantly Sahulian, and this pattern was only 
reproduced by completely lifting the dispersal 
and precipitation constraints from the model 
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(p = 0.96; Fig. 5B and fig. S11). This could be 
because birds have the broadest precipitation 
tolerances and the highest volancy of any class 
of vertebrate, and therefore have fewer con- 
straints on colonization (fig. S13). Because the 
simulation model is agnostic to the taxa in- 
vestigated, we expect a similar mechanism to 
operate across a range of taxa not explored 
here, and there is some evidence that niche 
and dispersal traits are important drivers of 
interchange and patterns of turnover in plants 
(10, 43). 


Conclusion 


The longstanding view of biotic interchange 
is that it is primarily governed by plate tec- 
tonics and dispersal ability, with rates of colo- 
nization in terrestrial organisms being directly 
proportional to the source pool size (J), the 
geographic distance between regions, and the 
emergence of land bridges (9). A more com- 
plete picture emerges when the deep-time 
legacy of climate connectivity and niche con- 
servatism on species distributions are also 
considered. Advances in deep-time climate re- 
constructions make it possible to test this 
directly, and support for a key role of climate 
connectivity is emerging in the Indo-Australian 
archipelago (33) and more broadly in other 
systems such as Indo-Eurasia (7, 42) and the 
American interchange (41). The factors that 
shape colonization success are relevant in 
both a historical and a contemporary context, 
because biotic exchange is exacerbated by 
human intervention, and understanding these 
factors is therefore relevant for predicting 
the success of new biological colonizations 
and invasions (35). This is particularly true 
in the Indo-Australian archipelago, where bio- 
logical ivasions are contributing to some of the 
highest rates of native species extinction on 
the planet (45). An emerging view of global 
biodiversity patterns is that they are largely 
the outcome of historical changes in plate 
tectonics shuffling lineages around the planet, 
in concert with major fluctuations in precip- 
itation and temperature shaping dispersal, 
speciation, and extinction dynamics through 
environmental niches (46, 47). A mechanistic 
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modeling approach enables us to move be- 
yond “lines” in biogeography and instead con- 
sider the processes that shape global variation 
in biodiversity patterns. 
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Chromatin expansion microscopy reveals nanoscale 
organization of transcription and chromatin 
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Nanoscale chromatin organization regulates gene expression. Although chromatin is notably 
reprogrammed during zygotic genome activation (ZGA), the organization of chromatin regulatory 
factors during this universal process remains unclear. In this work, we developed chromatin expansion 
microscopy (ChromExM) to visualize chromatin, transcription, and transcription factors in vivo. 
ChromExM of embryos during ZGA revealed how the pioneer factor Nanog interacts with 
nucleosomes and RNA polymerase II (Pol II), providing direct visualization of transcriptional 
elongation as string-like nanostructures. Blocking elongation led to more Pol Il particles clustered 
around Nanog, with Pol II stalled at promoters and Nanog-bound enhancers. This led to a new model 
termed "kiss and kick", in which enhancer—promoter contacts are transient and released by 
transcriptional elongation. Our results demonstrate that ChromExM is broadly applicable to study 


nanoscale nuclear organization. 


pon fertilization, embryos undergo tran- 

scriptional and cellular reprogramming 

to form a totipotent zygote (1-5). This 

reprogramming results in the transcrip- 

tional activation of genes required to 
initiate zygotic development. During this pro- 
cess, pioneer factors open the chromatin, re- 
cruit RNA polymerase II (Pol II), and activate 
transcription (6-9). However, the molecular 
organization of these factors during genome 
activation remains unclear. Nanoscale visu- 
alization of transcription and chromatin 
with current super-resolution approaches is 
limited by their ability to analyze nanoscale 
structure at the whole-nucleus scale, particu- 
larly in vivo. Alternatively, chromatin electron 
microscopy and tomography (ChromEMT) (70) 
provides nanometer resolution but lacks the 
multilabel imaging required to identify spe- 
cific regulatory interactions. To overcome these 
limitations, we combined pan-expansion mi- 
croscopy (pan-ExM) (17) with multimodal pro- 
tein, RNA, and DNA labeling to resolve the 
nanoscale organization of chromatin, tran- 
scriptional activators, and transcription in a 
method that we term chromatin expansion mi- 
croscopy (ChromExM). We applied ChromExM 
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to uncover how the pioneer factor Nanog in- 
teracts with chromatin and recruits Pol II to 
activate transcription during genome repro- 
gramming after fertilization. 


Nanog forms DNA-bound foci 
associated with Pol Il recruitment 
and transcription 


In zebrafish, Nanog is required for genome re- 
programming and transcriptional activation 
after fertilization (8, 9, 12, 13). To investigate 
the molecular organization of transcriptional 
activation during genome activation, we ana- 
lyzed the spatial organization of Nanog from 
the 32- to the 512-cell stage when transcription 
is activated. Live imaging of Nanog using a 
LlamaTag (/4) revealed that Nanog formed 
clusters associated with the very early tran- 
scription of microRNA-430 (miR-430) (47% 
of miR-430 transcription sites contain Nanog 
foci) (Fig. 1, A and B; fig. S1, A to C) (15-18). 
Quantification of Nanog fluorescence revealed 
that these foci increased the local Nanog con- 
centration 2.5- to 4-fold (fig. S1B), which is con- 
sistent with the occurrence of transcription 
factor and coactivator foci or hubs observed 
during embryogenesis and in cultured cells 
(19-28). We also observed that Nanog foci 
were associated with Pol II elongation, which 
is consistent with previous observations 
(6, 17, 22, 25, 27-30). Within a cell cycle, the 
formation of Nanog foci preceded Pol II elon- 
gation (Fig. 1, C and D), as visualized with a 
genetically encoded mintbody that detects elon- 
gating Pol II (Pol II pSer2) (25). After Pol II 
elongation begins, Nanog foci become less in- 
tense, which suggests that Nanog clusters may 
be evicted or dispersed after initiating tran- 
scription. In maternal-zygotic Nanog mutants 
(MZnanog’ ~) (8), Pol II foci are lost and overall 
levels are reduced at the 256-cell stage, which 


mt. 
confirms Nanog’s role in Pol II recruitment oot 
transcriptional activation (fig. S1, D and E) (--,-— 
Mutating the Nanog homeodomain to prevent 
DNA binding abolished Nanog foci, which indi- 
cates that they form in a DNA-binding-dependent 
manner rather than strictly through protein- 
protein interactions (fig. S1, F to I). Thus, Nanog 
forms DNA-bound hubs that increase its local 
concentration and are associated with Pol I 
recruitment and transcriptional activation dur- 
ing genome reprogramming. 


ChromExM achieves single-nucleosome 
resolution while preserving chromatin 
organization across scales 


Next, we aimed to determine the underlying 
nanostructure and molecular organization of 
Nanog and RNA Pol II during chromatin re- 
programming and transcriptional activation. 
To this end, we adapted the concept of pan- 
ExM (J) to include metabolic labeling of DNA 
and nascent RNA, along with antibody label- 
ing to visualize the chromatin with nanometer 
resolution, hereafter termed ChromExM (Fig. 
1, E and F). ChromExM involves the direct 
anchoring of biomolecules to a swellable hy- 
drogel through the addition of acryloyl groups, 
which allows protein and nucleic acid reten- ‘ 
tion during expansion (J7). We achieved an 
average ~15x linear expansion factor of the 
nuclei in embryos (Fig. 1G), which corre- 
sponded to a ~4000-fold increase in nuclear + 
volume and provided ~15-nm lateral resolu- 
tion on a confocal microscope and ~3 nm in 
stimulated emission depletion (STED) super- 
resolution microscopy (Fig. 2). 

Previous studies have reported conflicting 
results regarding the isotropy of ~4x to 8x 
expansion of chromatin (3/, 32). To address 
whether chromatin structure is perturbed by 
physical expansion, we developed an assay in 
which a pattern of parallel stripes is photo- + 
cleaved into biotin-labeled DNA before expan- ‘ 
sion and visualized after expansion (Fig. 2A; 
fig. S2, Aand B; and materials and methods). 

If chromatin expands isotropically, then the . 
relative position of chromatin chains should 
be preserved, and the photocleaved stripes will 
remain parallel after expansion. We observed 
that stripes generated before expansion re- 
main parallel after expansion and show no 
significant variation in their spacing when 
compared with simulated perfectly straight 
stripes generated as controls (Fig. 2, B and 
C; fig. S2, C and D; and materials and meth- 
ods). This indicates that chromatin expands 
isotropically and the relative spatial organi- 
zation of chromatin is preserved across the 
nucleus at submicron to global length scales 
by ChromExM (Fig. 2, B and C). 

The preservation of chromatin structure at 
the nucleosomal level depends, in part, on the 
mesh size of the hydrogel, which affects the 
positional uncertainty and anchoring frequency 
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Fig. 1. Nanog forms DNA-bound foci associated with zygotic transcription. 

(A) Schematic that details how Nanog and miR-430 transcription were visualized in 
living embryos. eGFP, enhanced green fluorescent protein. (B) Live imaging shows 
Nanog foci associated with miR-430 transcription at 2.5 hpf. n = 10 nuclei. (C) Live 
imaging shows Nanog foci are formed and disappear before Pol II elongation at = 2.75 hpf. 
n = 2 nuclei, 1 embryo. (D) Quantification of fluorescence intensity in (C). a.u., arbitrary 
unit. (E) Schematic that shows the process of ChromExM and expected results. AA, 
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acrylamide; BIS, N,N'-methylenebis(acrylamide); DHEBA, N,N'-(1,2-dihydroxyethylene) 
bisacrylamide; EU, 5-ethynyl uridine; SA, sodium acrylate. (F) Unexpanded and expanded 
nuclei from two unrelated embryos stained for DNA and imaged with a 10x 0.3 numerical 
aperture (NA) objective demonstrate the enhanced resolution provided by ChromExM. 
Scale bar is not corrected for the expansion factor. (G) Quantification of the nuclear 
expansion factor determined by measuring the nuclear cross-sectional area. n = 171 


unexpanded nuclei from 3 embryos and 104 expanded nuclei from 17 embryos. 
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Fig. 2. ChromExM pre- 
serves chromatin 
architecture and resolves 
chromatin fibers and 
individual nucleosomes. 
(A) Schematic that shows 
how the chromatin is 
painted with photocleaved 
(PC) stripes to detect 
perturbations after expan- 
sion. (B) Expanded nucleus 
that shows that PC stripes 
remain parallel and sharp 
after expansion. The image 
is a maximum intensity 
projection of several 
z-slices. n = 20 nuclei from 
3 embryos. (C) Quantifica- 
tion of relative interstripe 
distance in PC stripes versus 
a simulated control 
(methods) shows minimal 
variation in the spacing 
between stripes after 
expansion. n = 3 nuclei from 
3 embryos. (D) Schematic of 
in vitro-assembled nucleo- 
some arrays. (E) ChromExM 
image (left) that shows 
nucleosome arrays with 

H3 staining and EM image 
of nucleosome arrays with a 
similar conformation as the 
expanded array. Red scale 
bar is not corrected for the 
expansion factor. (F) Chro- 
mExM image with metabolic 
DNA labeling at 2.75 hpf 
that shows individual chro- 
matin fibers and a schematic 
of chromatin. n = 6 nuclei, 
5 embryos. (G) H3 staining 
imaged with confocal and 
STED microscopy resolves 
individual nucleosomes. 

n = 468 nucleosomes; 

3 nuclei from 2 embryos. 
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of biomolecules to the gel (33). To determine 
whether chromatin structure is maintained at 
the nucleosome scale, we measured the mesh 
size of our swellable hydrogel by assessing the 
mobility of differently sized molecules in the 
gel using fluorescence recovery after photo- 
bleaching (fig. $3, A to E) (34). We observed 
that a 3000 molecular weight (MW) dextran 
recovered quickly after photobleaching with 
a diffusion coefficient of 0.72 um?/s, but his- 
tone H1 and a 2,000,000 MW dextran did not 
recover (fig. S3, A to E). These results demon- 
strate that the mesh size is small enough to 
prevent rapid diffusion of the ~32-kDa H1 
protein, which suggests that the gel polymer 
can anchor the chromatin with subnucleoso- 
mal resolution. To functionally test this, we 
compared ChromExM and electron micros- 
copy (EM) of in vitro-assembled nucleosome 
arrays containing 13 nucleosomes along 2.7 kb 
of DNA (Fig. 2, D and E; fig. $3, F and G) (35). 
We observed a similar organization of nucleo- 
some arrays in ChromExM and unexpanded 
EM, with an average of 12.7 and 12.5 nucleo- 
somes detected per array, respectively (Fig. 2E 
and fig. $3, F to H). Taken together, these 
results demonstrate that the local chromatin 
organization can be maintained at the nucleo- 
somal scale during ChromExM. 

To visualize chromatin, we metabolically 
labeled the DNA with (2’S)-2'-deoxy-2’-fluoro- 
5-ethynyluridine (f-ara-EdU) (36), followed by 
fluorescent picolyl azide detection after expan- 
sion (37). This approach was more photostable 
than the intercalating dye SYTOX Green and 
increased the labeling intensity ~45-fold com- 
pared with standard azides (fig. S3, J to L), 
which improved chromatin labeling to over- 
come the effects of molecular decrowding and 
reduced brightness caused by the ~4,000-fold 
volumetric expansion. This approach resolved 
chromatin fibers in the nucleus (Fig. 2F) with 
similar diameters (<12 to 40 nm) to those de- 
termined by EM (0). Nucleosomes are ~10 nm 
in size, which is at the resolution limit of con- 
focal microscopy after ChromExM. There- 
fore, we performed STED microscopy with 
ChromExM to improve the resolution by an 
additional factor of approximately five, which 
approached <3-nm lateral resolution, and 
demonstrated that ChromExM can resolve 
individual nucleosomes by using histone H3 
immunostaining and confocal microscopy 
(~80% of nucleosomes detected by confocal 
are individual nucleosomes by STED) (Fig. 2G 
and fig. S31). The combination of ChromExM 
with STED provides fast multimodal molecu- 
lar imaging of the chromatin that approaches 
<3-nm resolution. 


Visualizing interactions between nucleosomes 
and the pioneer factor Nanog 


We used ChromExM to determine the under- 
lying nanoscale organization of transcription 
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hubs during genome activation by visualizing 
the molecular-scale interactions among the 
pioneer factor Nanog (8, 9), nucleosomes, and 
Pol II at 4 hours post fertilization (hpf). To 
analyze how Nanog interacts with chromatin, 
we costained for Nanog and H3 and visualized 
an average of 88,619 and 104,347 individual 
particles of Nanog and H3 per nucleus, respec- 
tively, with a false-detection rate of 0.05% for 
identifying Nanog particles (Fig. 3, A to D; fig. 
S4, A to G; and movie S1). Quantitative analy- 
sis of the spatial organization of Nanog and 
H3 particles revealed several classes of Nanog- 
nucleosome conformations (Fig. 3, B to G; fig. 
S4, A to E and H to I; and materials and meth- 
ods). In the first class, Nanog was closely asso- 
ciated with nucleosomes (<20 nm distance 
between Nanog and H3), with 2.0% of H3 and 
3.3% of Nanog particles falling into this group 
(Fig. 3, B and E, and fig. S4A), which likely 
indicates a bound state in which Nanog is po- 
tentially initiating chromatin opening. In the 
second class, a subset of class 1, Nanog clusters 
were bound to a nucleosome (>1 Nanog par- 
ticle within 20 nm and additional Nanog 
particles within 50 nm), which represents re- 
cruitment of multiple Nanog particles to the 
same chromatin region (0.06% of H3, 0.2% of 
Nanog) (Fig. 3, C and F, and fig. S4, B and B’). 
The third class contained Nanog particles dis- 
tantly associated with nucleosomes (20 to 
100 nm away from each other) (26.9% of H3, 
39.7% of Nanog), which potentially indicates 
chromatin regions already opened by Nanog 
(Fig. 3, D and G, and fig. S4C). The fourth 
class contained nucleosomes and Nanog not 
closely associated with each other (>100 nm 
away) (71.2% of H3, 57.0% of Nanog) (fig. S4, D 
and E). Our analysis identified 88,619 Nanog 
particles per nucleus, which is consistent with 
the number of chromatin immunoprecipita- 
tion followed by sequencing (ChIP-seq) peaks 
(~40,000) identified at a similar stage in zebra- 
fish (8, 38). These measurements are likely the 
lower bounds of Nanog-nucleosome interac- 
tions considering that some nucleosomes may 
not be detected here. Taken together, we are 
able to characterize pioneer factor-nucleosome 
organization during chromatin opening. 


RNA Pol Il forms string nanostructures 
associated with nascent RNA 


To understand how transcription is organized 
during chromatin reprogramming, we used 
ChromExM to visualize Nanog and Pol II phos- 
phoserine 5 (pSer5), which is deposited on Pol II 
after recruitment to the promoter (Fig. 3, H to 
K; fig. $5, A to C; and movie S2) (39-41). We 
detected an average of 27,503 Pol II particles 
per nucleus, which have characteristics con- 
sistent with single-molecule detection on the 
basis of their size and homogeneous intensity 
(fig. S5, D to G). Globally, the average distance 
between Pol II pSer5 and Nanog particles was 


94 nm (Fig. 3L), with 19% of Pol II pSer5 par- 
ticles being within 50 nm of Nanog particles, 
consistent with Nanog’s role in Pol II recruit- 
ment (8). However, we observed that Pol II 
exhibited three distinct types of organization 
(Fig. 3, I to K, and fig. S5, A, B, C, and H). The 
first class involved large groupings of inter- 
spersed Pol II pSer5 and Nanog particles (412 
Pol II pSer5 particles and 100 Nanog particles 
on average, 132-um mean length) arranged 
like beads on a string, which we refer to as 
class 1 strings and occurred twice per nucleus 
in 86% of nuclei observed (Fig. 3, I and M to O; 
fig. S5A). RNA fluorescence in situ hybridiza- 
tion (RNA FISH) for miR-430 revealed that 
class 1 strings represent sites of active miR-430 
transcription (Fig. 3, P and Q; fig. S6, A to C). 
To validate this result, we assembled the high- 
ly repetitive miR-430 locus using single hap- 
loid embryo long-read genome sequencing 
and identified a single, continuous ~550-kb | 
miR-430 locus (fig. $7, A to F), which corre- 
sponded to an estimated length of ~200 um, 
which is consistent with the 132-um length of 
class 1 strings (Fig. 30). This locus has been 
estimated to have >300 promoters capable of 
transcribing primary miR-430 transcripts (42), 
which is consistent with the ~1800 mature 
miR-430 genes detected in our assembly (fig. 
87, C and D). Similarly, we detect ~400 Pol II 
pSer5 particles in class 1 strings, which likely 
represent binding at these promoters (Fig. 
3M). The string-like structure that connects 
Pol II and Nanog particles is consistent with 
Pol II and Nanog binding profiles observed 
by ChIP-seq at this locus (fig. S7, G and H) 
(8, 12). Additionally, miR-430 RNA FISH sig- 
nal and class 1 strings are lost in miR-430'~ 
embryos (figs. S6, A and B, and S8, A and B) 
(43), which we confirmed lack all miR-430 
genes using long-read genome sequencing 
(fig. S7, C to F). Together, these results de- 
monstrate that class 1 strings show the nano- 
scale organization of the transcriptionally 
active miR-430 cluster (6, 17). 

The second class of Pol II pSer5 organiza- 
tion involved multiple Pol II particles orga- 
nized into strings with an average length of — 
831 nm (2 to 70 particles; mean of 4 particles 
per string) (Fig. 3, J and M to O; fig. S5, B and 
H). In this class, 37% of Pol II pSer5 particles 
were within 50 nm of Nanog particles, con- 
sistent with Nanog’s role in Pol II recruit- 
ment. We hypothesized that these linear 
class 2 Pol II pSer5 strings represent indi- 
vidual genes loaded with Pol II arranged in 
single-file lines. Class 2 strings are still present 
in miR-430/~ embryos (fig. S8, B and C), 
which indicates that they represent other tran- 
scribed genes (8). This processive organization 
of Pol II is reminiscent of active transcrip- 
tional elongation. In other cases, we observed 
branched Pol II pSer5 strings (Fig. 3J and fig. 
S5, B and H), which is consistent with the 
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transcription factory model in which multiple 
genes are in close proximity and share a reg- 
ulatory Pol II hub (29, 44, 45). Indeed, we de- 
tected an average of 822 Pol II pSer5 strings 
per nucleus, which is consistent with the ~1700 
active zygotic genes at 4 hpf we previously 
identified (17). 

The third class of Pol II organization in- 
volved “macroclusters” (Fig. 3K and fig. S5C) 
similar to those previously observed by super- 
resolution imaging (20-22). We detected an 
average of 28 Pol II pSer5 macroclusters per 
nucleus. In cases in which these macroclusters 
were occupied by both Nanog and Pol II, we 
were able to resolve distinct regions within the 
cluster occupied solely by Nanog and Pol II 
(Fig. 3K and fig. S5C), as if the two factors were 
tethered on distinct DNA elements. These ex- 
amples are consistent with Nanog bound to 
enhancers that reside in close proximity 
(<50 nm) to Pol II bound at the promoter 
(8, 38). Similarly, we observe all three classes 
of Pol II pSer5 organization during the minor 
wave of genome activation at 2.5 hpf, although 
there were fewer active sites, which is con- 
sistent with lower levels of transcription at 
this time (fig. S9, A to H) (16, 17, 46). This sug- 
gests that each class of Pol II organization 
represents a generalizable state of Pol II orga- 
nization broadly used throughout genome ac- 
tivation. At 4 hpf, we observed that there are 
an average of 5.3 Pol II particles within 200 nm 
of each Nanog particle among all three classes, 
which suggests that multiple Pol IIs are re- 
cruited by Nanog. Among the macroclusters, 
we also observed multiple cases in which a 
string of RNA Pol II emanated from the cluster 
(Fig. 3K), which raises the question of how 
these two distinct Pol II pSer5 structures may 
function. 

We hypothesized that Pol II pSer5 strings 
represent actively transcribing Pol II, which 
has exited the shared regulatory hub where 
Nanog, Pol II, and potentially other coactiva- 
tors congregate to control gene expression. 
To test this, we combined metabolic RNA label- 
ing using 5-ethynyl uridine with ChromExM 
labeling for Pol II pSer5 and observed nascent 
transcripts associated with all three classes of 
Pol II pSer5 structures (Fig. 3, R to U; fig. S10, 
A to F; and movie S3). We observed a wide 
range of transcription levels associated with 
the Pol II pSer5 strings, which suggests that 
they may encompass actively transcribed gene 
bodies as well as paused promoters (fig. S1OG). 
The organization of these transcriptionally ac- 
tive Pol II pSer5 strings is consistent with 
Pol II elongation along the gene body of indi- 
vidual genes, for which we visualized multiple 
Pol II pSer5 particles that extrude nascent 
transcripts in concert (Fig. 3, R to T; fig. S10, 
A, B, C, E, and F). Taken together, these re- 
sults support a model in which Nanog and 
Pol II clusters are in close physical proximity 
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and form a regulatory hub that activates tran- 
scription as Pol II exits the hub to form a string 
along the gene body (Fig. 3R and fig. S10A). 


Enhancers and promoters are kicked apart by 
transcription elongation 


Next, we assessed whether these Pol II pSer5 
structures are formed in a transcription- 
dependent manner (Fig. 4, A to H; fig. S11, A 
to G; and movie S4). Inhibiting transcription 
elongation with o-amanitin (47-50) led to a 
strong reduction of Pol II pSer5 string length 
(66% reduction; P = 0.002) (Fig. 4, C, D, G, H, 
and J) and a mild reduction in the size and 
number of Pol II macroclusters (26.3% reduc- 
tion in number; P = 0.0304) (Fig. 4J and fig. 
S11H) without reducing Pol II particle count 
(fig. S111). Taken together, these data indicate 
that Pol II pSer5 strings represent transcrip- 
tion elongation, whereas Pol II pSer5 macro- 
clusters may function as a regulatory hub 
formed independently of elongation. 

We also observed that Nanog and Pol II 
particles were closer after treatment with 
a-amanitin (Fig. 4K; median distance 106 nm 
versus 66 nm; P < 0.001), with more Pol II 
pSer5 particles surrounding each Nanog par- 
ticle (Fig. 4L; 3.1 versus 7.6 Pol II per Nanog; 
P < 0.001). Similar behavior between Nanog 
and Pol II was observed when transcription 
was inhibited in miR-430/~ embryos (fig. SID), 
which suggests that these changes occur at 
multiple loci in the genome. We explain these 
results with the following model: Nanog first 
recruits Pol II to enhancer-promoter hubs, 
which brings them into close proximity; then 
Pol II exits the regulatory hub in the form of 
Pol II pSer5 strings, which transcribes the gene 
body (Fig. 4). When transcription elongation 
was inhibited, we observed an increase in the 
stoichiometry and proximity of Nanog-Pol II 
particles, which is consistent with Pol II stall- 
ing at the promoter and the stabilization of 
enhancer and promoter interactions. 

To test this model, we performed ChIP-seq 
for Pol II pSer5 in wild-type and a-amanitin- 
treated embryos and observed extensive pileup 
of Pol II at promoters when blocking tran- 
scription (Fig. 4M and fig. S11K). As predicted 
from our ChromExM results, Pol II pSer5 ac- 
cumulated at Nanog-bound enhancers and 
other accessible sites (Fig. 4, N and O; fig. S11, 
Land M), yet these regions are not enriched 
for RNA Pol II in wild-type embryos (Fig. 4, N 
and O; fig. S11, L and M) (8). These results can 
be explained by a model in which enhancer- 
promoter contacts are transient, consistent 
with the kiss-and-run model (27), and there is, 
therefore, a larger mean distance between 
Nanog and Pol II (Fig. 4K). However, in the 
absence of transcriptional elongation, Pol II is 
continuously brought to the promoter and re- 
mains in close proximity to the enhancer as 
observed by ChIP-seq. This is further supported 


by the increase in the number of Pol II parti- 
cles that surround each Nanog particle (Fig. 
4L). On the basis of these data, we propose a 
modified version of the kiss-and-run model 
termed “kiss and kick,” in which transcrip- 
tion itself kicks away the enhancer from the 
promoter as Pol II elongates (Fig. 4P). This 
would explain why we observe a stabilization 
of Poll II in close proximity to Nanog-bound 
enhancers when elongation is inhibited. 


Discussion 


Here, we developed ChromExM for multimodal 
super-resolution chromatin imaging by phys- 
ically enlarging biological samples to achieve 
nucleosomal resolution. We demonstrate that 
chromatin organization is preserved and de- 
veloped an improved metabolic labeling strat- 
egy for chromatin imaging (Fig. 2). ChromExM 
provides markedly higher resolution than pre- 
vious ExM applications for chromatin imaging | 
(~3 to 15 nm versus ~65 nm) (37). Other meth- 
ods to visualize chromatin, such as ChromEMT 
(10), lack multimodal labeling, and the resolu- 
tion of single-molecule localization micros- 
copy is limited by the size of the fluorescent 
labels (~20 nm for primary and secondary anti- 
body), which becomes negligible in ChromExM 
given that labels are applied after expansion 
(1D). Although ChromExM provides substantial 
technical advances, it requires bright and photo- 
stable fluorescent labeling and chemical fix- 
ation, which may affect the appearance of 
certain structures. Future methods will be 
needed to identify specific loci through DNA 
FISH compatible with ChromExM to inves- 
tigate specific regulatory structures at indi- 
vidual genes. 

We used ChromExM to characterize the 
nanoscale organization of the pioneer factor 
Nanog and RNA Pol II as they activate tran- 
scription during genome activation. Although 
previous studies have shown how pioneer fac- 
tors direct chromatin opening in vitro (57, 52) 
and organize into hubs in vivo (19, 28, 53, 54), 
the nanoscale organization of such hubs has 
remained unclear. ChromExM revealed that 
Pol II shows three types of organization during 
genome activation and is intimately associated 
with Nanog (Fig. 3, H to K). Previous studies 
have shown that transcription factors (TFs) 
such as Oct4, Brd4, and Mediator form clus- 
ters associated with superenhancers (21, 23, 24). 
With the resolution of ChromExM, we can 
now resolve how Nanog and Pol II often oc- 
cupy distinct regions within transcription hubs, 
which is consistent with their binding at en- 
hancers and promoters that are in close contact. 

How enhancers and promoters are orga- 
nized to activate transcription is central to 
understanding gene regulation. Previous re- 
ports have concluded that enhancer-promoter 
contact is correlated (55), anticorrelated (56), 
or unrelated (57) with transcription, which 
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leaves the question of how enhancer-promoter 
contact is related to transcription unanswered. 
Three models have been proposed to explain 
how enhancers and promoters interact to 
control gene expression: (i) stable contact 
between enhancers and promoters, (ii) a dy- 
namic kissing model with transient enhancer- 
promoter contacts, and (iii) a TF activity 
gradient in which TFs diffuse from enhancers 
to promoters rather than requiring physical 
contact (58). Our results are not consistent 
with models (i) and (iii) and lead to a model in 
which Nanog can form clusters at enhancers, 
coming into physical proximity with Pol IT at 
the promoter and eventually triggering tran- 
scription, which we observed outside of this 
shared regulatory hub as strings of Pol II. 
Consistent with this model, we found that 
Pol II strings were substantially reduced when 
elongation was inhibited, which coincided with 
an accumulation of Pol II at promoters and 
in Nanog-bound enhancers and indicates that 
the enhancer and promoter were stabilized in 
close contact (Fig. 4, A to P). Taken together, 
these results suggest that Pol II elongation 
displaces enhancer-promoter contacts and 
lead us to propose the kiss-and-kick model. 
In this model, the dynamic association be- 
tween the enhancer and promoter is kicked 
apart during elongation. This effect can be 
caused by either transcription elongation or 
the nascent RNAs that have been shown to 
dissolve Mediator condensates (59). The kiss- 
and-kick model could also explain transcrip- 
tional bursting (60), as elongation would be 
triggered in intervals while the enhancer and 
promoter are in contact and then paused after 
elongation kicks away the enhancer. 
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It’s not just about the money 


don’t think industry would be a good fit for me,’ the student said, “I’m not money-motivated.” As an 

industry scientist myself, I struggled to hide my surprise. I was participating in a conference event 

to help graduate students and postdocs explore career options by connecting with professionals 

across industry, academia, government, and nonprofits. I love my work, and I was excited to share 

it with the attendees. Overall, they were enthusiastic to learn about their options, and some were 

seriously considering industry. I was glad to help them figure out whether it was right for them 
and, if so, how they could get there. So that student’s dismissive comment caught me off-guard—though 
I suspect others felt the same way. After pausing to regain my composure, I shared my own journey. 


When I was in graduate school, a 
relative I considered a surrogate 
grandmother and confidant, whom 
I spent holidays with and who even 
did a reading at my wedding, was 
diagnosed with late-stage ovarian 
cancer. She was hospitalized about 
an hour and a half away from her 
home and family—but near where 
I was living for graduate school. I 
visited her regularly and attended 
doctor’s appointments with her. I 
put together a comfort purse for 
her chemotherapy treatment, com- 
plete with ginger chews, a blanket 
I had crocheted, and other items to 
help with side effects and pass the 
time. I was there to see the bruises 
on her arms from constant needles, 
to helplessly watch her become 
less and less herself as the chemo- 
therapy ravaged her body and even- 
tually stopped working. When she was transferred to an 
assisted living facility, I stopped by on my drive home to see 
her and read to her. 

During one visit, she said it was too bad my Ph.D. re- 
search wasn’t about cancer. Instead, my doctorate was in 
toxicology: analyzing dust for allergens, bacteria, and fungi. 
Her comment struck me, but changing course didn’t seem 
a viable option. My expertise felt too far from cancer. Plus, 
I had chosen to study toxicology because it is broadly im- 
portant, marketable, and intellectually stimulating. Those 
reasons still seemed compelling, and the thought of aban- 
doning my carefully laid plans because of an emotional per- 
sonal experience felt unprofessional. 

Eventually, she was transferred to hospice closer to her 
hometown, where she passed peacefully. And I went on to 
do my postdoc in a pharmacology and toxicology labora- 
tory, assessing the impact of ozone exposure and dietary 
omega-3 fatty acids on pulmonary inflammation in mice. 


“The majority of industry 
scientists ... value the 
contributions they make.” 


It was during my postdoc that I 
first began to explore nonacademic 
career options, thanks in large part 
to my amazing, supportive adviser, 
who encouraged me to attend ca- 
reer development symposiums 
and helped me make connections 
and network. That’s when pursu- 
ing drug development came on 
my radar. But I still hesitated to 
make the transition—until a gradu- 
ate student asked me to proofread 
her award application. The essay 
explained that she wanted to get 
into drug development because of a 
loved one’s illness. Her essay moved 
me and showed me a new way my 
career could have meaning. 

In the 3.5 years I have been 
working in industry, I have found 
that meaning. The transition from 
academia has been an adjustment 
and a learning journey, but industry is a great fit for me. 
I now work for a company developing cancer treatments, 
and I am proud of the contributions I am making to help 
patients. I think of the dear relative I lost often and for me, 
that is motivating. 

Contrary to what the student at the conference as- 
sumed, working in industry isn’t all about the money for 
me. Of course making a living is important, and there’s 
nothing wrong with financial considerations playing a 
role in career decisions. But the majority of industry sci- 
entists I know aren’t doing it only for the money; they 
enjoy their jobs and value the contributions they make. 
And many of them, like me, have come to this work with 
a deeply personal motivation. And there’s nothing wrong 
with that either. & 


Brita Kilburg-Basnyat works in oncology drug development in 
Wisconsin. Send your career story to SciCareerEditor@aaas.org. 
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